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ABSTRACT 

A  grammar  G  is  context  limited  if  there  exists  a  partial  ordering  on  the 
alphabet  of  G  under  which,  far  every  production  o-  -»  9  of  G,  every  letter  of  0 
is  smaller  than  some  letter  of  8.  It  is  proved  that  the  languages  generated 
by  context  limited  grammars  are  Just  the  context  free  languages.  Unambiguity 
of  general  grammars  is  defined  and  discussed  carefully,  preparatory  to  proving 
that  the  language 6  generated  by  unambiguous  context  limited  grammars  cure  Just 
the  unambiguous  context  free  languages. 
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THE  EQUIVALENCE  OF  CONTEXT  LIMITED  GRAMMARS 
TO  CONTEXT  FREE  GRAMMARS^) 

A  phrase  structure  grammar  [3,  p.8]  is  here  called  context  limited  if 
there  exists  a  partial  ordering  on  the  alphabet  of  the  grammar,  under  which, 
for  every  production  <v  —  B  of  the  grammar,  every  letter  of  or  is  strictly 
smaller  than  6ome  letter  of  0. 

We  will  prove 

(1)  that  the  context  limited  grammars  generate  all  and  only  context 
free  languages;  and 

(2)  that  the  unambiguous  context  limited  grammars  generate  all  and 
only  unambiguous  context  free  languages. 

The  bulk  of  the  effort  goes  to  (2)  because,  although  the  intuitive  notion 

of  unambiguity  is  clear  enough,  it  has  not  previously  been  defined  formally 

(2) 

for  phrase  structure  grammars  in  general.  The  notion  of  the  ambiguity  of 
a  grammar  is  Just  that  there  be  two  essentially  different  ways  of  generating 
the  same  word.  Section  2  is  devoted  to  formalizing  this  term  "essentially 


Research  sponsored  in  part  by  the  Air  Force  Cambridge  Research  Laboratories, 
Office  of  Aerospace  Research,  USAF,  under  Contract  FI962867OOOO8,  and  by  the 
Air  Force  Office  of  Scientific  Research,  Office  of  Aerospace  Research,  USAF, 
under  AFOSR  Grant  No.  AF-AF0SR-12O3-67. 

(2) 

'  'This  is  because  there  has  been  no  need  for  such  a  definition.  With  respect 
to  the  most  general  class  of  phrase  structure  grammars,  there  are  no  inherently 
ambiguous  grammars,  and  it  is  not  yet  known  whether  the  corresponding  assertion 
for  context  sensitive  grammars  holds  or  not. 
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different, "  to  establishing  a  few  basic  facte  about  the  formulation,  including 
the  fact  that  it  reduces  to  the  usual  one  in  the  context  free  case,  and  to 
prowing  ease  technical  lemmas  far  use  in  Section  3* 

In  Section  3  the  context  limited  grammars  are  defined  formally  and  their 
two  main  properties,  indicated  above,  are  proved  (Theorems  1  and  2). 

Section  1  explains  the  notation. 

Most  of  the  material  presented  here  appeared  first  in  the  author's 
doctoral  dissei’tation  [lO]. 
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1.  Notation 

The  notation  is  drawn  mainly  from  [3],  with  the  exceptions  now  to  be 
noted . 

Grammar  will  mean  phrase  structure  grammar  [3,  p.8]. 

Let  G  =  (V,E,P, a)  be  a  grammar.  We  will  refer  to  the  letters  in  E  as 

external  letters  (to  avoid  conflict  with  another  natural  use  of  the  term 

"terminal")  and  to  those  in  V-E  as  internal  letters. 

ry  I—  8  will  be  used  instead  of  »  =»  0. 

G  G 

A  complete  O-derivatioc  is  a  G-derivation  beginning  with  a  and  ending  with 
* 

word  in  E  . 


o  is  the  initial  or  start  letter  of  G. 
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2.  G- graphs  and  Ambiguity 

The  first  objective  is  to  develop  the  notation  for  talking  about  ambiguity, 
to  establish  a  few  simple  facts  about  it,  and  to  set  forth  the  definition  of 
ambiguity. 

The  definition  of  ambiguity  to  be  given  is  equivalent  to  the  one  implicit 
in  Griffiths'  paper  [9].  However,  using  Griffiths'  definition  would  present 
as  much  of  a  problem  in  establishing  fundamental  properties  as  does  the  present 
one.  Moreover,  we  feel  that  the  present  construction  embodies  in  a  more  direct 
manner  the  intuitive  notion  of  ambiguity;  far,  whereas  Griffiths  chooses  a 
canonical  derivation  from  a  set  of  equivalent  derivations,  we  are  going  to 
deal  with  a  structure  possessed  in  common  by  all  the  derivations  in  an 
equivalence  class. 

The  basic  entities  are  the  production  graph  and  the  labeled  production 
graph.  The  notion  of  a  production  graph  is  represented  accurately  by  strings 
of  beads  hung  together  as  in  Figure  1.  The  numbers  beside  the  beads  are 
only  for  reference  and  do  not  represent  port  of  the  graph.  The  string 
(l,2,3>^,5>6)  Is  the  initial  string  of  the  graph.  The  string  (1,7,8,10,15,16,14) 
is  the  terminal  string,  for  these  are  the  exposed  beads  in  their  proper 
left-to-right  order,  all  other  beads  having  been  "covered"  by  hanging  a  new 
string  across  them.  If  we  wanted  to  specify  this  graph  without  drawing  a 
figure,  we  could  use  a  notation  such  as  the  following: 
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Initial  string:  1,2, 3, 4, 5, 6 

Cover  2,3  by  7,8,9 
5,6  by  12,13 
9,4  by  10,11 
12,13  by  14 
11  by  15,16 


Thus  a  production  graph  is  specified  by  specifying  an  initial  string  and  a  set 
of  ordered  pairs  of  strings.  Note  that  not  every  set  of  ordered  pairs  of  strings 
makes  sense:  every  first  string  must  at  some  stage  of  the  construction  be  a 
segment  of  the  terminal  string  at  that  stage,  and  at  the  same  stage  the 
accompanying  second  string  must  consist  of  newly  introduced  beads.  These 
conditions  would  be  taken  care  of  automatically  in  the  process  of  actually 
hanging  the  beads,  but  must  be  made  explicit  in  the  formal  definition. 

The  beads  will,  be  called  "nodes";  the  ordered  pairs  of  strings  will  be 
called  "productions." 

By  associating  a  letter  of  some  alphabet  with  each  node  of  a  production 
graph  we  construct  a  labeled  production  graph.  If  G  is  a  grammar  and  T  is  a 
labeled  production  graph,  if  each  production  of  T  farms  a  production  of  G 
when  the  labels  replace  the  node  nam'.s,  and  if  every  label  is  in  the  alphabet 
of  G,  then  T  is  a  "G-graph."  Every  G-graph  represents  at  least  one  (possibly 
mare)  G-derivation.  If  0-graphs  representing  complete  derivations  are 
unique  (up  to  congruence)  far  each  derived  word,  then  G  will  be  called 
unambiguous. 

The  remainder  of  this  section  is  devoted  to  giving  precise  formulations 
far  the  above  notions  and  making  sure  that  these  formulations  have  the 
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properties  we  want.  We  will  first  confine  our  attention  to  production  graphs, 
then  consider  labeled  graphs  and  G-graphs. 

2.1.  Production  graphs. 

By  a  string  we  will  mean  a  (possibly  null)  finite  nonrepeating  sequence 
of  elements  of  some  set.  If  t  and  a  are  strings,  then  to  will  denote  the 
concatenation  of  the  two.  Of  course  To  need  not  be  a  string  Just  because  t 
and  o  are,  but  we  adopt  the  convention  shat  when  we  write  to  we  mean  it  to 
be  a  string. 

a  is  a  substring  of  T  if  t  «  T1CTT2* 

A  (graphical)  production  is  an  ordered  pair  (o, t)  where  o  and  r  are 
strings  with  no  element  in  common  and  neither  a  nor  r  is  null. 

If  4  is  a  set  of  productions,  thereby  the  nodes  of  4,  we  mean  the 
elements  in  the  strings  in  the  productions  in  4. 

The  term  "production  graph"  will  denote  a  structure  associated  wi  each 
of  two  derivations  when  the  two  derivations  are  "essentially  the  same."  A 
production  graph  will  consist  simply  of  a  set  of  productions  together  with  an 
"initial  string,"  which  together  have  certain  properties.  These  properties 
are  most  easily  expressed  by  an  inductive  definition. 

In  the  definition  of  production  graph,  given  next,  it  will  be  convenient 
to  include  the  definition  of  the  terminal  string  of  the  production  graph,  so 
that  we  will  actually  be  defining  an  ordered  triple,  "a  production  graph  (4,n) 
with  terminal  string  t."  It  will  be  shown  shortly  that  4  and  p  entail  a 
unique  t  so  that  production  graphs  :an  be  given  as  ordered  pairs. 


Let  0  denote  the  empty  set. 
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DEFINITION. 

PG1.  If  u  is  a  string,  then  (0,u)  is  a  production  graph  with 
terminal  string  pi. 

PG2.  If  ($,u)  is  a  production  graph  with  terminal  string 

TlT2Ty  and  P  ie  a  string  containing  no  node  of  1  and 
no  node  of  u>then  (4  U  fiV,,p)},p)  is  a  production  graph 
with  terminal  string  p  Xy 

PG3-  (i,u)  is  a  production  graph  with  terminal  string  t  only 
if  required  by  PG1  and  PG2. 

If  (4,u)  is  a  production  graph,  then  we  call  u  the  initial  string  of 

If  ($,U)  and  p  are  as  in  PG2,  and  if  T  denotes  ($,u),  then  T  +  (t,,p) 
denotes  the  production  graph  ($  U  { (  tv  ,  p) },  u) . 

We  are  going  to  rely  heavily  on  certain  properties  of  production  graphs. 
All  of  these  properties  are  Just  what  we  would  expect  them  to  be,  but  Just 
elusive  enough  to  require  proof. 

Our  first  task  is  to  show  that  every  production  graph  has  a  unique 
terminal  string.  Wt  anticipate  this  proof  in  the  following  definition: 

A  terminal  nrjc  of  a  production  graph  is  a  node  belonging  to  the  terminal 
string  of  the  production  graph. 

For  the  time  being,  read  "a  terminal  Btring"  for  "the  terminal  string"  in 


the  previous  definition 


lo  January  lyt>7 


11 


TM-73«/0 30/00 


PROPOSITION  2.1.1.  Let  T  be  a  production  graph.  A  node  p  of  T  is  a  terminal 
node  of  T  if  and  only  if  p  does  not  occur  in  the  first  string  of  any  production 
of  T.  Every  terminal  node  of  T  occurs  in  every  terminal  string  of  T. 

Proof.  The  assertion  is  obvious  for  graphs  with  no  productions.  Suppose  the 
assertion  holds  for  graphs  with  a  productions,  aid  let  T  be  a  graph  with  n  +  1 
productions.  Let  p  occ’ir  in  a  terminal  string  of  T.  Then  there  exists  a 
graph  Tq  with  n  productions  and  terminal  string  p^crp^,  and  a  production  (a, t) 
such  that  T  *  Tq  +  (a,  t)^  and  such  that  p  is  in  PpTP2*  P  is  in  t  then  p 

is  not  in  o  and  p  is  not  a  node  of  Tq  (far  all  the  nodes  of  t  must  be  foreign 
to  Tq  in  order  to  apply  PG2) .  Hence  p  does  not  occur  in  the  first  string  of 
any  production  of  T.  If  p  is  in  or  then  p  is  a  terminal  node  of  Tq, 
hence  by  inductive  hypothesis  occurs  in  the  first  string  of  no  production  of 
Tq,  and  p  is  not  in  a  because  P^op^  is  a  string.  Thus  p  occurs  in  the  first 
string  of  no  production  of  T. 

Conversely,  suppose  p  occurs  in  the  first  string  of  no  production  of  T. 

A  graph  with  no  productions  can  only  cane  via  PG1,  where  it  is  clear  that 
every  node  is  in  the  one  terminal  string.  Suppose  it  is  true  in  graphs  with  n 
productions  that  every  node  such  as  p  belongs  to  every  terminal  string.  Let  T 
be  s  graph  with  n  +  1  productions.  Any  terminal  string  of  T  is  given  by  a 
graph  Tc  with  terminal  string  P-^op^  and  a  production  {a,  t),  with 
T  -  Tq  +  (a, t),  where  T,  has  n  productions.  If  p  is  in  t  then  p  occurs  in  the 
terminal  string.  If  p  Is  not  in  t  then  p  is  a  node  of  r*.  .  Every  production 
of  T  is  a  production  of  T,  so  p  occurs  in  the  first  string  of  no  production 
of  T  .  Hence  p  occurs  in  every  terminal  string  of  T(),  in  particular  in 
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p^cjp0 .  p  is  not  in  a  far  then  p  would  occur  in  the  first  string  of  (0,  *r). 
Hence  p  is  in  or  p 2,  hence  in 

PROPOSITION  2.1.2.  No  node  of  a  production  graph  T  occurs  in  the  second  string 
of  more  than  one  production  of  T. 

Proof.  The  assertion  holds  if  there  are  no  productions.  Assume  the  assertion 
holds  for  all  graphs  with  n  productions.  Let  T  ■  Tq  +  (a, t)  where  Tq  has  n 
productions.  The  nodes  of  t  are  foreign  to  Tq  so  no  node  of  r  can  occur  in 
the  second  string  of  two  distinct  productions  of  T.  But  far  any  other  node 
to  do  so  would  require  it  to  do  so  in  Tq,  contrary  to  the  induction  hypothesis. 

PROPOSITION  2.1.3*  Let  T  be  a  production  graph  and  (a,  t)  a  production  of  T 
such  that  all  the  nodes  of  t  are  terminal  nodes  of  T.  Then  t  is  a  substring 
of  every  terminal  string  of  T,  and  for  every  terminal  string  p1tp2  of  T 
there  exists  a  unique  production  graph  T#  with  terminal  string  P-^oPg  such 
that  T  -  T'  +  (ct,t). 

Notation.  The  unique  T#  6uch  that  T  -  T*  +  (a,  t)  will  be  denoted  T  -  (o,  t). 

Proof.  The  assertion  is  vacuously  true  for  graphs  with  no  productions.  Assume 

it  holds  for  graphs  with  n  productions.  Let  T  have  n  ♦  1  productions  and 

satisfy  the  hypothesis  with  respect  to  a,r.  Let  X  be  a  terminal  string  of  T. 

There  exists  a  graph  T  with  terminal  string  U  L  such  that  T  ■  T  +  (o  .t  ) 

o  1  o  2  o  o  o 

and  such  that  X^t  "X.  If  (oQ, tq)  -  (a, t),  then  indeed  t  is  a  substring 
of  X.  If  (<Jo>T0)  ^  (°;Tl  then, by  2.1.2,  tq  Is  disjoint  iron  T,  and  (o,t)  is  a 


lb  Jamaary  1957 


13 


TM- 735/030/00 


production  of  Tq.  By  2.1.1,  since  every  production  of  Tq  is  a  production  of  T, 

every  node  of  r  is  a  terminal  node  of  Tq.  By  inductive  hypothesis  then,  t  is  a 

substring  of  X  a  X_.  t  is  disjoint  from  o  .  by  2.1.1.  Hence  t  is  a  substring 
1  o  2  ~ 

either  of  X^  or  of  >  ,.  In  either  case  t  i6  a  substring  of  X. 

To  construct  T 7  now,  let  us  assume  t  1b  a  substring  of  X^  and  observe  as 

ve  go  that  the  proof  is  easily  adaptable  to  X^.  So  let  X^  ■  X^tX^.  By 

inductive  hypothesis  there  exists  a  graph  With  terminal  string  ^cr^j_a0^ 

such  that  T  *  T7  +  (a,  t).  We  let  T7  «=  T7  +  (a  ,  t  )  according  to  PG2,  and  note 
00  o  o  o 

that  T7  has  terminal  string  X^cX^t^X^.  ^0»  Qnd  T7  were  all  derived  according 
to  PG2,  so  they  all  have  the  same  initial  string  as  T.  T7  lacks  only  one 
production  of  T,  namely  (a, t).  t'  +  (o,t)  is  a  production  grapn  according  to 
PG2,  and  has  tne  same  productions  and  initial  string  as  T.  That  is, 

T '  +  (ct,  t)  ■  T.  The  terminal  string  obtained  frau  this  construction  is 
X^tX^tqX^  =  \To^2  =  So  ^he  terminal  6tring  \a^iTo^2  iB  the  one  re(lulre^’ 

The  uniqueness  of  T7  is  clear,  for  a  production  graph  is  completely 
specified  by  its  productions  and  Initial  string. 

PROPOSITION  2.1.4.  A  production  graph  has  Just  "W  terminal  string. 


Proof.  The  assertion  holds  for  graphs  with  no  productions.  Assume  it  holds 
for  graphs  with  n  productions  and  let  T  be  a  graph  with  n  +  1  productions. 


Let  \Q  be  a  terminal  string  of  T.  Then  there  is  a  graph  Tq  with  terminal 

string  p.  o  p~,  and  a  production  (a  ,  t  )  of  T,  such  that  T  ■  T  +  (o  ,t  ) 

X  O  u  o  o  o  o  o 

Xq  ■  pxt0^2*  Similarly,  if  we  let  X^  be  any  other  terminal  string  of  T  we 
have  T  -  Tx  +  where  T^  has  terminal  string  P^pJ  and  X^  »  P1T1  P2‘ 


and 
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By  2.1.3,  is  a  substring  of  P1T0P2*  ^  (^0,T0)  ■  (°pTi)  then  Tq  -  T^,  for 

Tq  and  have  the  same  initial  string  as  T  and  would  then  have  the  same 

productions.  By  inductive  hypothesis  we  could  then  say  P^^Pp  ■  P^a.  Pg, 

whence,  since  oQ  ■  p^  ■  and  p?  ■  pg.  Thus,  since  tq  -  t^, 

P,  T  P  -  p  7t  p '  or  X  ■  L  . 

1  O  2  112  O  1 

If  (^0,T0)  ^  ( °i , Ti ),  then  tq  and  are  disjoint  according  to  2.1.2.  So 
t  is  a  substring  of  either  p^  or  Pg.  There  is  no  loss  of  generality  in 
choosing  Pp  so  that  p1  -  111611  “  ^iti^2ToP2*  By  2,1‘3  there 

exists  a  production  graph  T7  with  terminal  string  ^^W^TqPjj  such  that 
T  =  T  7  +  (opT^).  But  T7  must  therefore  have  the  same  productions  and  initial 
string  as  T^,  that  is,  T7  ■  T^.  By  -inductive  hypothesis  P^o^Tq^  "  P1°1P2* 
Therefore  -  p^  and  ^2ToP2  "  P2*  NoW 

\  -  elV2  -  Wo*!  -  Vop2  -  V 

We  have  thus  taken  two  arbitrary  terminal  strings  of  T  and  found  that  they  are 
the  same. 

The  duality  principle  established  next  (2.1.6)  will  be  used  quite 
frequently. 

PROPOSITION  2.1.5.  If  T  -  is  a  production  graph  and  o  is  a  string 

with  no  nodes  in  common  with  then  (4  U  f (a,Ug) is  a  production  graph 
with  the  same  terminal  string  as  T. 

Notation.  (4  U  ((a,Ug)  will  be  denoted  by  (o,ii g)  +  T. 
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Proof.  If  $  is  empty,  vie  get  the  assertion  by  applying  PG2  to  ($,4^04^).  For 
the  induction  step  on  T  +  (r,  p),  we  form  (c,42)  +  T  by  inductive  hypothesis, 
note  that  it  has  the  same  terminal  string  as  T  and  has  the  initial  string  of 
the  graph  we  want  to  construct,  and  then  form  ((o>U5)  +  T)  +  (t,  p)  by  PG2. 

PROPOSITION  2.1.6.  (Duality.)  Let  ($,4)  be  a  production  graph  with  terminal 
string  r.  Let  =  {(Pg,  P^)  :  (Pj,Pg)  in  Then  (^t)  j.6  a  production 
graph  with  terminal  string  u. 

Notation.  (^r)  will  be  called  the  dual  of  ($,4). 

Proof.  If  i  is  empty,  then  4  ■  t  and  the  assertion  holds.  For  the  induction 
step  on  T  +  (p^,  Pg),  form  the  dual  of  T  by  inductive  hypothesis,  note  that  its 
initial  string  is  the  same  as  the  terminal  string  of  T,  so  that  (PgjP^)  can  be 
adjoined  according  to  2.1.5* 

The  duality  principle  2.1.6  gives  us  dual  propositions  to  2.1.1,  2.1.2, 
2.1.3.  For  example,  the  dual  of  2.1.1  asserts  that  a  node  is  an  initial  node 
if  and  only  if  it  does  not  occur  in  the  second  string  of  any  production.  The 
dual  of  2.1.4  asserts  that  any  two  production  graphs  with  the  same  productions 
and  the  same  terminal  string  must  have  the  same  initial  string,  i.e.,  must  be 
equal. 

The  next  objective  is  to  formulate  some  rules  for  making  changes  internally 
in  a  graph  so  that  the  resulting  entity  still  is  a  production  graph.  These 
changes  will  be  expressed  in  terms  of  the  removal  and  replacement  of  "subgraphs." 


16  January  1967 


16 


TM-738/030/oo 


By  an  initial  subgraph  of  a  production  graph  T  ■  ($,u),  we  mean  a  production 
graph  such  that  i '  c  $,  An  intermediate  string  of  T  is  a  terminal  string 

of  an  initial  subgraph  of  T.  A  string  of  T  is  a  substring  of  an  intermediate 
string.  By  the  terms  initial  or  terminal  substring  of  T,  we  mean  respectively 
a  substring  of  the  initial  or  terminal  string  of  T. 

Notice  that  if  is  an  initial  subgraph  of  T2  and  T,.  is  an  Initial  sub¬ 
graph  of  T^,  then  is  an  initial  subgraph  of  T^.  Consequently,  if  is  an 
initial  subgraph  of  T2,then  any  intermediate  string  of  T^  is  an  intermediate 
string  of  T2* 

We  call  a  string,  production,  or  graph  disjoint  from  another  string, 
production,  or  graph  if  the  two  entities  have  no  nodes  in  common. 

PROPOSITION  2.1.7*  i*?t  bo  an  initial  subgraph  of  a  production  graph  T, 
let  a  be  a  terminal  substring  of  T^,  and  let  (o,  t)  be  a  production  of  T. 

Then  t  is  disjoint  from  and  T^  +  (a, t)  is  an  initial  subgraph  of  T. 

Proof.  We  have  only  to  prove  that  no  node  of  t  is  a  node  of  T^,  for  then,  by 
PG2,  T^  +  (o,  t)  is  a  production  graph  with  the  same  initial  string  as  T^,  hence 
as  T,  and  all  of  the  productions  of  T^  +  (a, t)  belong  to  T. 

(o,t)  is  not  a  production  of  T^,  by  2.1.1.  If  (0^,1^)  is  any  production 
of  Tj,  then, by  2.1.2  applied  to  T,  and  t  are  disjoint.  Hence  by  the  dual 
of  2.1.1,  t  can  contain,  at  most,  initial  nodes  of  T^.  But  the  initial  nodes 
of  T^  are  Just  the  initial  nodes  of  T,  and  none  of  these  can  be  in  t  by  the 


dual  of  2.1.1. 
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PROPOSITION  2.1.8.  Let  be  an  initial  subgraph  of  a  production  graph  T, 
with  ^  T.  There  exists  a  terminal  substring  a  of  T^  and  a  production  (a,r) 
of  T  such  that  T^  +  (a, t)  is  an  initial  subgraph  of  T. 

Proof.  By  induction  on  the  number  of  productions  of  T.  If  T  has  no  productions, 
then  T  has  no  proper  initial  subgraph,  so  the  assertion  is  true  vacuously. 

Suppose  the  assertion  holds  for  graphs  with  n  productions,  and  let 
T  =  T7  +  (a, t)  have  n  +  1  productions.  Let  T^  be  an  initial  subgraph  of  T. 

If  (a,  t)  is  not  a  production  of  Tp  then  is  an  initial  subgraph  of  T#.  If 
T^  ■  T*  then  the  required  terminal  substring  and  production  are  o  and  (o,  t). 

If  T*  4  Tp  then  the  required  production  is  obtained  from  the  inductive 
hypothesis. 

Suppose  (o,  t)  is  a  production  of  Tp  Then  it  is  a  terminal  production  of 
Tp  by  2.1.1.  T^  =  T2  +  (a,  t)  for  same  Tg,  by  2.1.3*  is  a  proper  initial 
subgraph  of  T*  lest  T^  be  equal  to  T.  By  inductive  hypothesis  there  is  a 
terminal  substring  o^  of  T^  and  a  production  (f-TpT1)  of  T^  such  that  T  +  (opT^) 
is  an  initial  subgraph  of  T ' .  is  disjoint  from  c,  for  otherwise  we  would 
have,  by  the  dual  of  2.1.2,  that  (crp  t^)  =  (a,  t),  but  (a^,^)  is  in  T '  and  (ct,  t) 
is  not.  Therefore  o^  is  a  terminal  substring  of  T  +  (a, t)  o  Tp  By  2.1.7, 

T-^  +  (apT^)  is  an  initial  subgraph  of  T. 

PRi  OS  IT  ION  2.I.9.  Let  T  be  a  production  graph  and  a  a  non-null  string.  If 
and  oa2  are  both  strings  of  T,  then  a^aCg  is  a  string  of  T. 
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Proof*  It  will  suffice  to  consider  Just  the  case  where  a  has  length  1,  for 
the  general  case  can  be  reduced  to  this  by  considering  the  rightmost  node  q 
of  o  and  the  two  strings  o^o  and  qag. 

So  we  will  consider  strings  OjP  and  p<?2,  with  p  a  node  of  T.  There  exists 
an  initial  subgraph  of  T  with  n^  productions,  and  another,  T 2,  with  rig 
productions,  containing  OjP  and  pa2  respectively  as  terminal  substrings.  The 
proof  Is  by  induction  on  +  iig. 

If  ni  ♦  Op  «  0,then  o^p  and  pa2  are  initial  substrings  of  T,  whence  a^pcg 
is  an  initial  substring  of  T,  hence  a  string  of  T. 

Suppose  the  assertion  holds  for  all  cases  where  +  n2  *  n,and  consider 
a  case  where  n^  +  rig  ■  n  +  1. 

Case  1.  Suppose  T^  contains  a  terminal  production  (T1^T2)  where  T2  does  ncrt 
contain  p.  If  Tg  is  also  disjoint  from  c^,  then  has  terminal 

substring  a^p  and  only  n  -  1  productions,  so  that  the  inductive  hypothesis  is 
directly  applicable.  Suppose  is  not  disjoint  from  o^.  Then,  since  p  is  not 
in  t^,  T^  has  a  terminal  substring  Pi^PjjP  vtiere  °i  *8  a  rightmost  substring 
of  Hence  -  (Ti,  T2)  has  a  terminal  substring  P^^P^* 

inductive  hypothesis  tells  us  that  p^T^PgPOg  is  a  string  of  T.  Let  Tq  be  an 
initial  subgraph  of  T  containing  ae  a  terminal  substring.  By  2.1.7, 

TQ  +  (t^, Tg)  iB  ^  iHl^ial  subgraph  of  T.  T  +  (t^,  Tg)  contains  p^TgPgPOg. 
beiiig  a  rightmost  substring  of  P^T2P2>  alpa2  muflt  a  s-trl>nS  of 

Case  2.  Suppose  T,  contains  a  terminal  production  (t^,  T2)  wbere  T2  does  n°t 
contain  p.  The  argument  of  Case  1  is  adaptable  by  symmetry  to  this  case. 
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Nov  either  T.  or  T,  has  at  least  one  production,  and  by  the  symmetry  of 
the  situation  ne  may  as  well  assume  that  has  a  production.  Then  has  a 
terminal  production  By  Case  1,  we  assume  that  contains  p. 

By  the  dual  of  2.1.1,  p  Is  not  an  initial  node  of  T,  hence  not  an  initial 
rode  of  T, .  By  the  dual  of  2.1.1  and  by  2.1.2,  we  learn  that  (t^,  t  ,)  is  a 
production  of  T  .  Suppose  (t  ,0  iB  not  a  terminal  production  of  T  .  Then, 
since  T2  has  a  terminal  production,  by  2.1.2,  T^  must  have  a  terminal 
production  (t^,  t/)  where  does  not  contain  p.  But  this  is  Case  2.  So  we 
can  assume  is  terminal  in  Tg. 

Now  if  t2  contains  all  of  a j,  then  o^p  is  a  substring  of  t^,  hence  a 
terminal  substring  of  T^.  This  would  imply  that  o^pa^  is  a  terminal  substring 
of  T2,  i.e.,  that  a^p^  is  a  string  of  T.  So  we  can  assume  that  does  not 
contain  all  of  o^.  Similarly,  we  can  assume  that  t 2  does  not  contain  all  of  a^. 

We  conclude  that  there  must  exist  strings  o^  and  o^  such  that 
°lT2a2  “  ai^a2>  aiT2  a  substring  of  T^  and  a  terminal  sub¬ 
string  of  T2«  Therefore  is  a  terminal  substring  of  T,  -  (t^,t2).  By 

inductive  hypothesis,  ^T^a2  is  a  string  of  T,  hence  occurs  as  a  terminal 
substring  of  sane  initial  subgraph  Tq  of  T.  Tq  +  (t^,t2)  ls  ^ 
subgraph  of  T,by  2.1.7,  and  contains  the  terminal  substring  0-^2° 2  *  a^pa2, 
i.e.,  a^pag  is  a  string  of  T.  This  finishes  the  proof. 

PROPOSITION  2.1.10.  If  ( 4 , n)  is  a  production  graph  with  terminal  string  t, 
and  if  ^  and  n  are  strings  disjoint  from  (4,p-),  then  (4,p^y42)  is  a  production 
graph  with  terminal  string  .  Conversely,  if  (i,y^yy  )  is  a  production 
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graph  with  terminal  string  ^Ty^then  (4,^)  is  a  production  graph  with  terminal 
string  t. 

Notation.  If  T  denotes  (4,u),  then  (i^uug)  will  be  denoted  by  p^Tpg. 

Proof.  The  conclusion  is  direct  from  PG1  if  4  is  empty.  If  the  assertion  holds 
for  graphs  with  n  productions  and  T  has  n  +  1  productions,  choose  a  terminal 
production  (c^Og)  of  T  and  form  ^(T  -  (o^Cg))^  +  (cr^Cg).  This  is  the 
required  graph. 

For  the  converse,  the  assertion  is  trivial  if  4  is  empty.  Suppose  the 
converse  holds  for  all  graphs  with  n  productions  and  let  T  +  (c^Og)  have 
n  +  1  productions,  where  T  +  (a^Og)  has  initial  string  and  terminal 

string  u^tu  (.  By  the  dual  of  2.1.1,  Og  contains  no  node  of  the  initial 
substrings  Clearly,  therefore,  T  has  terminal  string  u^T'ug  for  some  t'. 

By  inductive  hypothesis,  T  -  for  some  graph  U.  Clearly  must  be  a 

terminal  substring  of  U,  hence  T  +  (a^Og)  -  u^U  +  (a^CTgJJug.  U  +  (a^Og) 
is  the  required  graph. 

PROPOSITION  2.1.11.  Let  (4,u)  be  a  production  graph  with  terminal  string 
Let  (4^,t)  be  a  production  graph  with  terminal  string  o,  and  with 
only  the  nodes  of  t  in  common  with  (4,u).  Ihen  (4  U  4^,u)  is  a  production 
graph  with  terminal  string 

Notation.  If  (4,u)  -  T  and  (4^,t)  ■  T^  then  (4  U  4^,n)  will  be  denoted  by 
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Proof.  By  induction  on  the  size  of  The  case  where  4^  is  empty  is  trivial. 

To  form  T  +  (T^  +  (p^,  P0))  in  the  induction  step,  merely  note  that  is  a 
terminal  substring  of  T  +  T^,  P2  is  disjoint  from  it,  hence  (T  +  T^)  +  (p1,  P2) 
is  a  production  graph  and  has  the  properties  required  for  T  +  (T^  +  (p^p  )). 

Let  us  now  define  a  subgraph  of  a  production  graph  T  ■  ($,p,)  to  be  a 
production  graph  ($',0)  such  that  4  '  c  4  and  a  is  a  string  of  T. 

Note  that  it  is  essential  to  have  the  initial  string  be  a  string  of  T, 
that  is,  occur  in  the  terminal  string  of  some  initial  subgraph.  For,  the 
idea  of  a  subgraph  is  that  it  can  be  viewed  as  an  integral  component  of  T.  It 
can  happen  that  T  contains  a  set  of  productions  which  form  a  production  graph 
but  not  a  subgraph  of  T. 

The  next  proposition  generalizes  2.1.7,  with  a  subgraph  in  the  role  of  the 
production. 

PROPOSITION  2.1.12.  Let  T^  be  an  initial  subgraph  and  a  subgraph  of  the 
production  graph  T.  If  the  initial  string  of  T^  is  terminal  substring  of  T^, 
then  and  T^  have  no  nodes  in  common  other  than  the  initial  nodes  of  T,  . 

Thus  +  T,  exists  and  is  an  Initial  subgraph  of  T. 

Proof.  By  induction  on  the  number  of  productions  in  T2»  The  assertion  is 
trivial  when  T^  has  no  productions.  Suppose  the  assertion  holds  in  all  cases 
where  T2  has  n  productions  and  consider  a  case  where  T2  has  n  +  1  productions. 
Then  T,  -  t/  ♦  (a, t)  for  some  terminal  production  (a, t)  of  Tg.  a  is  a  terminal 
substring  of  tJ.  T2  has  the  same  initial  string  as  T2,  and  has  only  n 
productions.  Therefore  T2  has  no  nodes  in  common  with  other  than  initial 
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nodes,  and  +  t/  is  an  initial  subgraph  of  T.  c  is  a  terminal  substring  of 
T1  +  Tj>by  2*1*11*  So, by  2.1.7,  t  is  disjoint  from  +  tJ  and  (T^  +  Tp  +  (a, t) 
is  an  initial  subgraph  of  T.  The  productions  and  initial  word  of  +  (T;  +  (o,  t)) 
are  the  same  as  those  of  (T^  +  t/)  +  (o,  t),  so  the  inauction  is  continued  to  n  +  1. 

PROPOSITION  2.1.13.  Let  (4,p.)  be  a  production  graph  with  terminal  string  t^tt  . 

If  (i^,o)  is  a  subgraph  of  (4,u)  with  terminal  string  t,  then  (4  -  Spu)  iB  a 
production  graph  with  terminal  string  T1^T2* 

Notation.  If  T  *=  (4, 4)  and  T^  -  (4^,  a),  then  (4  -  4^,u)  will  be  denoted  by 
T  -  Tj,  .  Also,  in  the  case  where  the  initial  string  o  of  T^  is  a  substring  of 
the  initial  6tring  of  T,  with  no  hypothesis  on  tne  terminal  string  t  of 

Tp  the  graph  (4  -  4^^,^^)^  which  is  obtained  from  the  dual  of  2.1.13,  will 
also  be  denoted  by  T  -  T^. 

Proof.  By  induction  on  the  size  of  4^.  If  4^  is  empty,  then  the  assertion  is 
trivial.  Suppose  the  assertion  holds  in  all  cases  where  T^  •  (4^,0)  has  n 
productions,  and  consider  a  case  where  T^  has  n  +  1  productions.  Let  (p^, p^) 
be  a  terminal  production  of  T^.  Let  T  **  (4,^).  To  assert  that  the  required 
production  graph  it  \T  -  (p  1^2})  "  "  (Pi^Pg))  we  need  only  show  that 

T^  -  (p^>P2)  is  R  subgraph  of  T  -  (p^,  Pg),  for  clearly  the  terminal  string  of 
Ti  "  ^  plf  p2^  is  a  tcnainal  substring  of  T  -  (p^,  P2).  The  initial  string  a 
of  T^  is  also  the  initial  string  of  T^  -  ( Pg).  Let  Tq  be  an  initial  subgraph 
of  T  containing  o  as  a  terminal  substring.  Tq  could  not  contain  ( p^,  p^) 
because  of  2.1.12.  Thus  every  production  of  Tq  is  a  production  of  T  -  (p^p^). 
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and  T  has  the  same  initial  string  as  T  and  T  -  ( 0, ,  p  ),  so  T  is  un  initial 
subgraph  of  T  -  (o^,p^).  Hence  a  is  a  6tring  of  T  -  (p  ,  pj,  so  T  is  a 
subgraph  of  T  -  ( Pp  P  ) . 

COROLIiARY.  Let  T7  ■  (47,t)  be  the  dual  of  the  production  graph  T  ■  Let 

To  *  (*q,u)  t)e  an  initial  subgraph  of  T,  let  «=  (4^, a)  be  the  dual  of  T^. 

Then  T#  -  is  an  initial  subgraph  of  T7  with  terminal  string  a. 

PROPOSITION  2.1.15.  Let  T  ®  (4,p)  be  a  production  graph,  let  4^  C  4. 

T^  =  ( ^1 , *Ai )  is  a  subgraph  of  T  if  and  only  if  the  terminal  string  of  T^  is  a 
6tring  of  T  and  T^  is  a  production  graph. 

Proof.  Suppose  T^  is  a  subgraph  of  T.  Then  is  a  string  of  T,  i.e.,  is  a 
terminal  substring  of  an  initial  subgraph  Tq  of  T.  By  2.1.12,  Tq  +  T^  is  an 
initial  subgraph  of  T.  Therefore  the  terminal  string  of  T^  is  a  string  of  T. 

Conversely,  suppose  the  terminal  string  of  T^  is  a  string  of  T.  Then  the 
dual  of  T^  is  a  subgraph  of  the  dual  of  T,  for,  by  2.1.13  applied  to  the  dual 
of  T  and  the  dual  of  an  initial  subgraph  Tq  of  T,  we  see  that  the  dual  of 
T  -  Tq  is  an  initial  subgraph  of  the  dual  of  T  and  has  the  same  terminal  string 
as  Tq.  Thus  the  strings  of  T  are  just  the  strings  of  the  dual  of  T.  So  the 
initial  string  of  the  dual  of  T^  is  a  string  of  the  dual  of  T.  By  the  first 
part  of  the  assertion  being  proved,  the  terminal  string  of  the  dual  of  T^, 
i.e.,  the  initial  string  of  T^,  is  a  string  of  the  dual  of  T,  hence  a  string 
.  Therefore  T^  is  a  subgraph  of  T. 


of  T 
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COROLLARY.  If  T^  is  a  subgraph  of  T  then  the  dual  of  ^  is  a  subgraph  of  the 
dual  of  T. 

Let  T  -  ($,n)  be  a  production  graph.  Let  T^  be  a  subgraph  of  T.  Let  T 
be  a  production  graph  with  the  same  initial  and  terminal  strings,  respectively,  as 
T. ,  but  with  no  other  nodes  in  common  with  T.  Then  we  say  that  is 
replaceable  by  in  T. 

PROPOSITION  2.1.16.  Let  T  -  ($,u)  be  a  production  graph  with  terminal  string  t. 
Let  T^  ■  be  replaceable  by  T 2  -  ($2,u2)  in  T*  Then  (($  -  U  $2,u) 

is  a  production  graph  with  terminal  string  t. 

Notation.  (($  -  U  $2,u)  will  be  called  the  graph  formed  by  replacing 
b^  T2  in  T.  We  will  use  seme  obvious  grammatical  derivatives  of  this. 

Proof.  Let  Tq  be  an  initial  subgraph  of  T  with  terminal  string  By 

2.1.12  and  2.1.11,  Tq  +  T^  1b  an  initial  subgraph  of  T  with  terminal  string 

By  the  dual  of  2.1.13,  T  -  (Tq  +  T^)  is  a  subgraph  of  T  with  initial 
string  and  with  terminal  string  t.  By  2.1.11,  Tq  +  T2  is  a  production 

graph  with  initial  string  u  and  terminal  string  We  want  to  form 

(Tq  +  T2)  +  (T  -  (T  +  T^)).  We  can  do  this  by  2.1.11, provided  that  the  two 
graphs  have  no  nodes  in  conmon  other  than  nodes  of  u  T, u*.  By  2.1.12,  T  +  T. 

O  X  O  O  X 

has,  except  for  nodes  of  no  nodes  in  common  with  T  -  (Tq  +  T^).  Except 

for  nodes  of  and  T^,  T,  has  no  nodes  in  common  with  T,  hence  none  in  common 
with  T  -  (Tq  +  T^).  So  if  Tq  4  Tg  has  a  nonterminal  node  p  belonging  also  to 
T  -  (T^  +  T^),  then  p  must  be  in  ^  and  not  in  t^.  Therefore,  by  2.1.1  applied 
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to  T^,  p  must  occur  in  for  some  production  (p^,p2)  of  T^.  But  then  p  is,  by 
2.1.1,  a  nonterminal  node  of  Tq  +  and  hence  cannot  belong  to  T  -  (T  +  T^). 

So  the  two  graphs  have  the  required  property  and  their  sum  can  be  formed. 

All  additions  are  of  the  type  that  preserves  the  initial  string  of  the 
term  on  the  left,  so  (Tq  +  T2)  +  (T  -  (T  +  T^))  has  initial  string  tt.  Letting 
denote  the  set  of  productions  of  Tq,  the  productions  of  (T  +  T^)  + 

(T  -  (T  +  T^))  are 

(iQ  U  $2)  'o  (4  -  (*Q  U  4X))  -  (iQ  U  *2)  U  (4  -  4X) 

-  «2  U  («  -  i±). 

The  last  equation  holds  because,  if  4^  ft  4^  were  not  empty,  then  some  production 
(p^,  P2)  would  be  common  to  Tq  and  T^,  whence  the  nodes  of  p2  would  be  noninitial 
in  Tp  yet  belong  to  Tq,  and  we  agreed  that  this  does  not  happen. 

(Tq  +  T2)  +  (T  -  (Tq  +  T^))  thus  has  the  required  initial  word  and  the 
required  set  of  productions. 

PROPOSITION  2.1.17.  Let  (4^, 4)  and  (4^,4)  be  initial  subgraphs  of  the 
production  graph  (4,4).  Then  ( 4^  H  4^4)  is  an  initial  subgraph  of  (4,4). 
Moreover,  if  a  is  a  terminal  substring  of  4^,  and  if  all  the  nodes  of  o  occur 
in  productions  of  4^  then  a  is  a  terminal  substring  of  (4^  H  42,u). 

Notation.  (4^  fl  42,u)  will  be  denoted  by  (4^,4)  H  (4^,^). 

Proof.  By  induction  on  the  number  of  productions  of  4^.  If  4^  is  empty,  then 
(4^  H  40,n)  «  (4^,u),  an  initial  subgraph  of  (4,4).  Moreover,  the  terminal 
string  of  (4^  D  42,ia)  is  in  this  case  4,  so  that  any  terminal  substring  is 


preserved  in  the  intersection. 
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Suppose  the  assertion  holds  in  all  cases  where  4^  has  n  productions,  and 
consider  a  case  where  4^  has  n  +  1  productions.  (4^,n)  =  (4./,n)  +  (p^  p,) 
for  some  terminal  production  (p^,p2)  of  (4^4). 

Suppose  (p1,  p2)  is  not  in  4g.  Then  ^  0  ^  .  4^  fl  $0,  whence  fl  $  > M- ) 
is  an  initial  subgraph  by  inductive  hypothesis.  Moreover,  suppose  a  is  a 
terminal  substring  of  (4^,u)  such  that  all  nodes  of  a  are  in  ($2,u)»  Now  a  is 
disjoint  from  p2,  for,  by  the  dual  of  2.1.1, no  node  of  p2  1g  in  4,  hence  any 
node  of  P2  in  (42,u)  is  a  noninitial  node  of  ($2,u),  hence  occurs  in  the 
second  string  of  some  production  of  u),  and,  by  the  dual  of  2.1.2, this  could 
only  be  the  production  (p1,p2).  But  we  Eire  assuming  that  (p  ,  p2)  is  not  in 
($2,u),  so  we  conclude  that  o  is  disjoint  from  P2»  It  follows  that  a  is  a 
terminal  substring  of  (4^, a)  and  hence,  by  inductive  hypothesis,  a  terminal 
substring  of  (4^  (1  42,u). 

Suppose  (p1, P2)  is  in  4g,  p1  is  a  terminal  s'  tring  of  (4^,4),  and  all 
the  nodes  of  p.^  are  in  (42,a).  By  inductive  hypothesis  (4^  0  42,u)  is  an 
Initial  subgraph  with  terminal  substring  p^  Hence  (4^  0  4^4)  +  (p^p,)  is 
an  initial  subgraph,  and  its  productions  eu:e  (4^  0  4,,)  U  {(p^,p2)}  =  4^  0  42» 
Thus  (4^  D  42,u)  is  an  initial  subgraph. 

Moreover,  suppose  o  is  a  terminal  substring  of  (4^,u)  such  that  every 
node  of  a  is  in  (42,n).  If  a  is  disjoint  from  p2  then  (4^,4)  conteiins  a  as  a 
terminal  substring  and  a  is  disjoint  from  p^.  By  inductive  hypothesis 
(4^  0  4^,4)  contains  both  a  and  p^  as  terminal  substrings,  a  and  p^  being 
disjoint,  (4^  0  42,ix)  +  (p^,p2)  ■  (4^  D  4^,4)  contains  a  as  a  terminal  substring. 
If  o  is  not  disjoint  from  p^, then  there  exists  a  terminal  substring  o^  PgOg 
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of  containing  ct  as  substring,  and  where  every  node  of  Qj.P2'72  *s  *n 

(v  ,u).  c]_0]_a2  is  a  terminal  substring  of  hence  of  D  $.-,u)  by 

inductive  hypothesis.  fT2.P2a2  is  tlierefore  Q  terminal  substring  of 

°  *2>'1)  +  (°i>p2^  c  ^  *2’ u^*  There^ore  a  is  a  terminal  substring 
of  (*,  n  $2,u). 

PROPOSITION  2.1.16.  For  each  production  (o,  t)  of  a  proauction  graph  T,  a  and  t 
are  strings  of  T. 

Proof.  By  induction  on  the  number  of  productions  in  T.  The  assertion  Is 
vacuously  true  when  T  has  no  productions.  Suppose  the  assertion  is  true  in 
all  cases  where  T  has  n  productions,  and  consider  a  case  where  T  has  n  +  1 
productions.  Then  T  ■  T  *  +  (a#,  t#)  where  T*  has  n  productions.  If  (o,  t)  is 
in  T*  then  o  and  r  are  strings  of  T#  by  inductive  hypothesis,  hence  strings 
of  T.  Otherwise  (a,  t)  •  (o',  t').  a'  is  a  string  of  T  because  It  is  a 
terminal  substring  of  the  initial  subgraph  T‘  of  T.  t  is  a  string  of  T 
because  it  is  a  terminal  substring  of  T. 

COROLLARY.  Every  node  of  T  1b  a  string  of  T. 

Proof.  Every  initial  node  is  in  the  initial  subgraph  (0,u);  every  terminal 
node  is  terminal  in  the  initial  subgraph  T.  All  other  nodes  occur  in 
productions,  by  2.1.1  and  its  dual,  hence  are  strings  of  T  by  2.1.18. 

PROPOSITION  2.1.19*  bet  Tq  be  an  initial  subgraph  of  a  production  graph  T  and 

let  T^  be  a  subgraph  of  T.  If  Tq  contains  all  the  noninitial  terminal  nodes  of 

T, .  then  T  containo  all  the  productions  of  T, . 

1/  O  ± 
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Proof.  By  induction  on  the  number  of  productions  in  T^.  If  has  no  produc¬ 
tions,  then  we  have  nothing  to  prove.  Let  have  n  +  1  productions  where  the 
assertion  holds  for  all  subgraphs  having  n  productions.  Let  +  (a, t). 

The  nodes  of  r  are  terminal  and  noninitial  in  T,  and  therefore  are  all  in  T  . 

1  o 

By  2.1.7,  (a, t)  is  in  T^.  Thus  the  nodes  of  o  are  in  Tq.  A  noninitial 
terminal  node  of  T^  is  either  in  o  or  is  a  noninitial  terminal  node  of  T^.  So 
by  inductive  hypothesis  Tq  contains  all  the  productions  of  T^.  Since  it  also 
contains  (o, t),  it  contains  all  the  productions  of  T^. 

PROPOSITION  2.1.20.  Let  Tq  be  an  initial  subgraph  of  a  production  graph  T, 
let  T^  be  a  subgraph  of  T.  If  Tq  contains  till  the  terminal  nodes  of  Tp  then  T^ 
is  a  subgraph  of  Tq. 

Proof.  We  only  have  to  show  that  the  terminal  string  t  of  T^  is  a  string  of  Tq. 
There  exists  an  initial  subgraph  U  of  T  containing  t  as  terminal  substring. 

By  2.1.17,  U  0  Tq  is  an  initial  subgraph  of  T  containing  t  as  terminal  sub¬ 
string.  U  H  Tq  is  also  an  initial  subgraph  of  Tq,  therefore  t  is  a  string  of 

V 

COROLLARY.  (By  duality.)  If  T  -  Tq  contains  all  the  initial  nodes  of  Tp  then 
T^  is  a  subgraph  of  T  -  Tq. 

COROLLARY.  (By  duality.)  If  Tq  fails  to  contain  any  of  the  nonterminal  initial 
nodes  of  Tp  then  T^  fails  to  contain  any  production  of  T^. 

While  2.1.16  allows  us  to  replace  a  single  subgraph,  we  will  need  to  be 
able  to  replace  more  than  one  subgraph  at  the  same  time.  It  can  happen  that 


16  January  1967 


29 


TM-73U/030/0G 


after  the  replacement  of  one  subgraph  the  next  one  is  no  longer  present  as  a 
subgraph,  even  though  all  of  it6  nodes  and  productions  are  still  there.  We 
next  set  forth  some  conditions  under  which  such  difficulties  do  not  occur. 


DEFINITION.  Two  subgraphs  of  a  production  graph  T  are  se parable  in  T  if  there 
exists  on  initial  subgraph  Tq  of  T  which  contains  all  the  nodes  of  one  while 

T  -  T  contains  all  the  nodes  of  the  other. 

o 

DEFINITION.  Let  T^  and  T 2  be  subgraphs  of  a  production  graph  T.  Let  and  U 2 
be  production  graphs.  T^,T2  are  replaceable  by  UpU2  in  T  if  T^  and  T2  are 
separable,  T^  is  replaceable  by  in  T,  T2  is  replaceable  by  U2  in  T,  and 
and  U2  have  do  common  nodes  other  than  their  initial  or  terminal  nodes. 

Ti,  ...,Tk,  all  subgraphs  of  T,  are  replaceable  by  l^, ...,Uk  in  T  if  for  all 
i  ^  J  it  is  true  that  T^Tj  are  replaceable  by  U^Uj  in  T. 

PROPOSITION  2.1.21.  Let  T^  and  T?  be  subgraphs  of  a  production  graph  T.  Let 
Ti,T2  he  replaceable  by  UpUg  in  T.  Let  V  be  formed  by  replacing  by 
in  T.  Then  T2  is  a  subgraph  of  V  and  T^  is  replaceable  by  U2  in  V. 


Notation.  The  graph  formed  by  replacing  T0  by  U2  in  V  will  be  referred  to  as 
the  graph  formed  by  replacing  T^,T 2  b^  U^,U2  —  T*  Moreover,  if  T^, . . .,Tk  are 
replaceable  by  U^,  ...,Uk  in  T,  then  repeated  use  of  2.1.21  allows  us  to  speak, 
with  the  obvious  meaning,  of  the  graph  formed  by  replacing  T^,  ...,Tk  by 
U^,  ...,Uk  in  T.  Clearly,  if  tt  is  any  permutation  on  the  first  k  integers, then, 
in  view  of  2.1.21,  the  graph  formed  by  replacing  T^q  y  •  .  with 

Un(i),  •  •  •jh’n(k)  in  T  is  the  some  as  the  graph  formed  by  replacing  T^,  ...,Tk 


by  1  ,  •  •  • ,  Uk  in  i.  • 
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Proof.  Case  1.  Suppose  there  exists  an  initial  subgraph  Tq  of  T  containing 
all  the  nodes  of  T,  while  T  -  T  contains  all  the  nodes  of  T„.  By  2.1.20  and 
its  dual,  is  a  subgraph  of  Tq  and  is  a  subgraph  of  T  -  Tq.  is 
replaceable  by  in  Tq.  Let  UQ  be  the  graph  formed  by  replacing  by  in 
Tq.  By  merely  checking  productions  and  initial  strings  we  see  that  Uq  +  (T  -  Tq) 
is  the  graph  V  formed  by  replacing  by  in  T.  T2  is  a  subgraph  of  V, 
because  it  is  a  subgraph  of  -  Tq.  U2  has  no  nodes  in  common  with  or  with 
T  except  for  initial  or  terminal  nodes  of  U^,  hence  no  nodes  in  common  with 
U  or  T  -  T  except  for  initial  or  terminal  nodes  of  U_.  U0  still  has  the 
same  initial  and  terminal  strings  as  T,,.  Therefore  is  replaceable  by  U0  in  V. 

Case  2.  There  is  an  initial  subgraph  Tq  of  T  which  contains  all  the 
nodes  of  T2  while  T  -  Tq  contains  all  the  nodes  of  T^»  Looking  at  the  duals 
of  these  graphs  we  see  Case  1  again,  and  conclude  that  the  dual  of  T2  is 
replaceable  by  the  dual  of  U,  in  the  dual  of  V.  Therefore  T^,  is  replaceable 
by  U2  in  V. 

PROPOSITION  2.1.22.  Let  T^  and  T2  be  two  subgraphs  with  no  productions  in 
common.  Suppose  neither  subgraph  has  a  node  which  is  both  initial  and 
terminal  in  the  subgraph.  If  each  of  T^,T2  has  Just  oie  initial  production, 
or  if  each  has  Just  one  terminal  production,  then  T^  tad  T2  are  separable. 

Proof.  Suppose  each  subgraph  has  Just  one  initial  production.  Let  the 
respective  initial  productions  be  (a^,T^)  and  (ct2,t2).  By  2.1.18,  there 
exists  an  initial  subgraph  Tq  containing  as  a  terminal  substring.  Suppose  T^ 
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contains  a  node  of  r^.  Then  by  2.1.7,  Tq  contains  Since  ^ 

(ri2,r^),  (^2,  t2)  is  in  the  initial  subgraph  Tq  -  (0^,1^).  By  2.1.18,  there  must 

exist  an  initial  subgraph  of  Tq  containing  t 2  as  terminal  substring.  No 

node  of  is  in  Tq.  is  an  initial  subgraph  of  the  main  graph,  which  we  now 
name  T.  Thus  by  a  change  of  notation  if  necessary  we  can  assume  that  is  a 
terminal  subword  of  an  initial  subgraph  Tq  which  contains  no  nodes  of  t|( 

No  node  of  is  an  initial  node  of  T  -  for  the  initial  nodes  of 
T  -  T  are  the  terminal  nodes  of  T  .  Hence,  by  2.1.1  and  2.1.2,  (o„,  t_)  is  in 

O  O  *  7  2 

T  -  Tq.  By  2.1.19,  since  c 2  must  be  the  initial  string  of  T2,  T  -  Tq  contains 
all  the  nodes  of  T,,. 

is  a  terminal  substring  of  Tq.  is  the  initial  string  of  Tp  hence 
t1  is  the  initial  string  of  ^  -  (opi^).  So  Tq  +  (T^  -  (op*^))  -  U  is  an 
initial  subgraph  of  T.  U  contains  no  production  of  T2,  for  Tq  does  not*  and 

only  productions  of  T^  art  added  to  form  U.  T-U  therefore  contains  all  the  nodeB 

of  T2,  because  every  node  of  T2  is  in  a  production  of  T2  lest  some  node  of  Tj 
be  both  initial  and  terminal,  so  that  should  T-U  lack  a  node  of  T2  it  must  also 
lack  a  production  of  T2»  U  contains  all  the  nodeB  of  for  the  same  kind 
of  reason.  Therefore  and  Tg  are  separable. 

2.2.  Labeled  production  graphs  and  ambiguity. 

DEFINITION.  A  labeled  production  graph  is  a  production  graph  in  which  each 
node  is  an  ordered  pair  (p,X),  where  p  is  the  node  name  and  X  is  the  node  label. 
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Notation.  It  will  seldom  be  necessary  to  make  explicit  reference  to  the  node 
name.  We  will  continue  to  use  p,q,r, ...  as  symbols  for  nodes. 

For  a  node  p,  cp(p)  will  denote  the  label  of  p. 

For  a  string  a  ■  P^***Pk>  the  3  abel  of  a,  written  0(0),  is  ro(p^) . .  .ro(pk) . 

For  a  production  (a, t)  of  a  labeled  production  graph,  the  label  of  (a,  r), 
written  cp((a,T)),  i o  a  production  <p(a)  -*  ro(r). 

Let  T  be  a  labeled  production  graph  with  initial  string  u  and  terminal 
string  t.  Then  rs(u)  is  called  the  initial  word  of  T,  and  co(t)  is  called  the 
terminal  word  of  T.  Continuing  in  the  same  vein,  an  initial  (terminal)  sub  word 
of  T  is  a  subword  of  the  initial  (terminal)  word  of  T,  an  intermediate  word 
of  T  is  the  label  of  an  intermediate  string  of  T,  a  ward  of  T  is  the  label  of 
a  string  of  T. 

DEFINITION.  Let  G  be  a  grammar.  A  labeled  production  graph  T  is  a  G- graph  if, 
for  every  production  (cr,  t)  of  T,  ro(a)  -•  co(t)  is  a  production  of  G. 

If  G  is  a  grammar  with  start  letter  S  and  external  alphabet  E,then  a 
complete  G- graph  is  a  G- graph  with  initial  wo.  u  S  and  terminal  ’»ord  1~  E*. 

Let  G  be  a  grammar.  Let  us  show  that  »  k?  0  if  and  only  If  thor exists 

U 

a  G- graph  with  initial  word  a  and  terminal  word  B. 

In  one  direction  we  proceed  by  induction  on  the  length  of  the  derivation. 
For  a  derivation  of  length  1,  the  required  G-graph  is  merely  a  sequence  of 
labeled  nodes  with  no  productions.  Suppose  that  for  all  derivations  or^,  ...,or 
of  length  n  there  is  a  corresponding  G-graph  with  initial  word  and  terminal 
word  0^  and  consider  a  derivation  <2^,  •  *  •'0n+i*  °n  bi6263'  Vi  ‘  biy93' 
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where  9„  -*  y  is  a  production  of  0.  There  is  a  graph  T  with  initial  string  a, 
and  terminal  string  T]_T2T3' sucb  that  ro(o)  «  o^,  cr^T.,)  ■  B^,  «-n( t2)  ■  B(>, 

C*10086  a  string  p  of  new  nodes  so  thau  cp(  p)  «  y.  T  +  (t,,  p)  has 
terminal  string  t^pt^,  and  terminal  ward  co{ *f^)rp( p)'o(t^)  ■  c^yB^  ■  and  the 

initial  word  o_  . 

In  the  other  direction  we  proceed  by  induction  on  the  number  of  productions 

in  the  G-graph.  If  the  G-graph  has  no  productions  then  it  corresponds  in  the 

required  way  with  a  one-word  derivation.  Suppose  a  G-graph  T  has  initial  word 

ry  and  terminal  word  B  and  that  or  I—  B.  Suppose  that  T  has  terminal  string 

G 

V2t3,  and  that  T  +  (t2,p)  is  a  G-graph.  The  terminal  word  of  T  +  (t2,  p)  is 

fD('1)'0(p)tp(T3). 

Clearly  B  -  pjrp^),  vrticnce  n  Iq  ®( p)'^ T^) . 

Notice  that  the  construction  of  the  G-graph  from  a  given  derivation,  a6 
set  forth  in  the  previous  proof,  is  unique  except  for  the  choice  of  node  names, 
given  for  each  i  the  exact  production  which  sends  to  .  The  derivation 
itself  does  not  necessarily  provide  this  information,  but  it  does  in  the  case 
of  "leftmost"  derivations  of  context  free  grammars,  which  we  will  be  considering 
shortly. 

Two  graphs  that  are  the  same  except  for  node  names  will  be  called 
congruent.  Formally: 

DEFINITION.  Two  labeled  graphs,  ( p^)  and  (52,p2),  are  congruent  if  there 
exists  a  one-to-one  function  h  from  the  nodes  of  ($^,p^)  onto  the  nodes  of 
($2,P2)  such  that: 
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(i)  u2  =  h(^); 

(ii)  (a, t)  is  a  production  of  if  and  only  if  (h(a),h(r))  is  a 
production  of  $2; 

(iii)  co(p)  ■  cp(h(p))  for  every  node  p  of  ($^,0^). 

3y  h(p^...pk)  we  mean,  of  course,  h(p^) . . .h(pk) . 

If  P  is  a  production  of  P  *»  (a, t),  then  by  h(P)  we  mea.i  the  production 
(h(a),h(T))  of  If  $ ;  is  a  subset  of  4>  then  by  h(,J/)  we  meeui  the  subset 

fh(P)  :  P  in  $  M  of  «  . 

It  is  easily  seen  that  if  ($',a)  is  a  subgraph  (initial  subgraph)  of 
( #1, U>i),  then  h((?  /Jc))  *  (h(4/),h(o))  is  a  subgraph  (initial  subgraph)  of 

( ‘*2* ^2^ * 

If  the  terminal  string  of  (4^,u^)  is  rf  then  h(i)  is  the  terminal  string 
of  ($2,u2).  Conversely,  we  can  conclude  that  the  two  graphs  are  congruent  if 
they  satisfy  (ii)  and  (iii),  and  have  respective  terminal  strings  and  Tg 
such  that  h(T1)  ■  Tg.  The  latter  assertion  follows  from  the  duality  principle. 

PROPOSITION  2.2.1.  Condition  (ii)  in  the  definition  of  congruence  can  be 
weakened  to: 

(ii  )  If  (o,t)  is  a  production  of  then  (h(c),h(T))  is  a  production 
of  $2- 

Proof.  Suppose  two  labeled  graphs  T^  and  Tg  satisfy  (i),  (ll'),  and  (iii) 
with  respect  to  the  one-to-one  mapping  h  from  the  nodes  of  T^  onto  the  nodes 
of  Tg.  Suppose  (h(a),h(T))  is  a  production  of  Tg  while  (a, t)  is  not  a 
production  of  T^.  No  node  of  t  could  be  an  initial  node  of  T^  because  by  (i)  p 


16  January  1967 


35 


TM-738/030/00 


is  an  initial  node  only  if  h(p)  is,  but,  by  the  dual  of  2.1.2,  no  node  o"  ) 
is  an  initial  node  of  T 2*  Therefore,  by  the  dual  of  2.1.2,  for  any  node  p  of  t 

there  is  a  production  (a',r')  of  where  p  is  in  t/.  (h(a,),h(T))  is  by  (ii ;) 

a  production  of  T:),  and  h(p)  is  in  h(T/).  (o',  r')  i  (c,  r),  and  since  h  is 

one-to-one,  this  entails  (h(c,),h(T/))  ^  (h(a),h(T)).  But  then  h(p)  occurs  in 
the  second  string  of  two  productions  of  T2,  contrary  to  the  dual  of  2.1.2.  This 
contradiction  results  from  assuming  that  (ii)  fails  to  hold. 

PROPOSITION  2.2.2.  There  is  at  most  one  congruence  between  two  given  labeled 
production  graphs. 

Proof.  Suppose  h^  and  h 2  are  congruences  from  T^  to  T^.  Let  and  u2  be  the 

initial  strings  of  T^  and  T2  respectively.  Then  h^^)  =  1,e*>  the  two 

agree  on  the  initial  nodes.  In  other  words,  h^  and  h^  agree  on  the  nodes  of 
the  initial  subgraph  of  T^,  which  contains  no  productions.  Suppose  they  agree 
on  the  nodes  of  any  Initial  subgraph  of  T^  containing  n  productions,  and  let 
Tq  +  (ct,t)  be  an  initial  subgraph  of  T^  containing  n  +  1  productions.  Then 
h^(a)  -  hgCcr), because  a  is  a  string  of  Tq  and  Tq  only  has  n  productions. 
(h^(a),h^(T) )  and  (h2(o),h2(T))  are  productions  of  T  .  Lest  the  nodes  of 
h^(o)  ■  h  (c)  appear  in  the  first  strings  of  two  distinct  productions,  we  must 
have  ^(t)  «  h2(*r).  Hence  h^  and  h2  agree  on  all  the  nodes  of  TQ  ♦  (a, t). 

This  extends  the  assertion  to  all  initial  subgraphs  of  T^,  one  of  which  is 
itself. 

PROPOSITION  2.2.3.  Let  T  and  T*  be  congruent.  I/it  T^,  ...,T^  be  replaceable  by 
U^,...,Uk  in  T,  and  suppose  the  congruence  h  fran  T  to  T7  is  extended  to  the 
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nodes  of  the  so  that  h  is  a  congruence  from  to  h^)  for  i  «  1, ...,k, 
and  so  that  h^), . . .,h(Tk)  are  replaceable  by  h^), . . .,h(Uk)  in  t'.  Then 
the  graph  formed  by  replacing  T^  . ..,Tk  by  1^,  ...,Uk  in  T  is  congruent  to  the 
graph  formed  by  replacing  hCT^) , . . .,h(Tk)  by  h^), . .  .,h(UK)  in  T7. 

Proof,  h  is  obviously  an  onto  mapping.  Let  us  see  whether  h  is  one-to-one. 

If  p  and.  q  fire  both  nodes  of  T,  then  h(p)  ^  h(q)  unless  p  =  q,  for  h  is  a 
congruence  from  T.  If  q  is  not  a  node  of  T,  then  q  is  a  noninitial  nonterminal 
node  of  one  of  the  U^.  Hence  h(q)  is  a  noninitial  nonterminal  node  of  h(Ui). 
Because  of  the  hypothesis  about  replaceability  in  T/ ,  h(q)  can  belong  neither 
to  T7  nor  to  any  h(Uj)  other  than  h(U1).  Therefore  if  h(p)  ■  h(q),  p  must  not 
belong  either  to  T  or  to  any  Uj  cither  than  U^.  p  has  to  belong  to  something, 
so  p  must  belong  to  U^.  Since  h  is  a  congruence  on  U^,  it  follows  that  p  ■  q. 
Therefore  h  is  one-to-one. 

If  u  is  the  initial  string  of  T,then  h(u)  is  the  initial  string  of  T7, 
and  the  replacements  do  not  change  the  respective  initial  strings.  Thus  (i) 
is  satisfied. 

Let  (ctjt)  be  a  production  of  the  graph  V  formed  by  the  replacement  of 
T-^,  ...,Tk  by  in  T.  If  (ct,t)  is  in  T,  then  (o,  t)  is  in  none  of  the  T^ 

because  all  of  the  productions  of  the  T^  have  been  replaced.  Then  (h(a),h(T)) 
is  in  T 7  without  being  in  any  of  the  h(T^).  Any  production  of  T7  not  in  any 
of  the  h(T^)  must  still  be  there  after  the  replacement  is  made,  so  (h(a),h(T)) 
is  in  the  replacement  graph.  If  (o, t)  is  not  in  T,  then  it  is  in  sane  U^, 
whence  (h(a),h(T))  is  in  h(U^).  This  again  requires  (h(a),h(T))  to  be  in  the 
replacement  graph.  Thus  the  assertion  (ii 7)  of  2,2.1  holds. 
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(iii)  holds  for  any  node  p  either  because  p  is  in  T  or  because  p  Is  jne 
of  the  U^. 

Since  we  have  (i),  (ii*),  and  (iii),  we  have  congruence  by  2.2.1. 

DEFINITION.  A  grammar  G  is  unambiguous  if  every  two  complete  G- graphs  having 
the  some  terminal  word  are  congruent. 

Let  us  make  sure  that  this  notion  of  unambiguity  reduces  in  the  context 
free  case  to  the  usual  notion. 

The  usual  notion  has  been  expressed  in  terms  of  trees  which  are  easily 
seen  to  be  just  the  context  free  case  of  our  production  graphs.  However,  the 
more  familiar  definition  is  in  terms  of  leftmost  generations  [2]. 

Let  G  be  a  context  free  grammar.  A  derivation  o^,...,(yn  is  leftmost  if 
for  each  i  <  n  we  have  *  ®iXi^i  811(1  where  Xi  “*  7^  is  a 

production  of  G  and  8^  contains  only  external  letters.  A  context  free  grammar 
is  unambiguous  if  for  every  word  w  which  contains  only  external  letters  there 
exists  at  most  one  complete  leftmost  G-derivation  ending  with  w. 

To  show  that  the  two  definitions  of  ambiguity  are  equivalent,  we  will 
establish  a  one-to-one  correspondence  between  complete  G- graphs  and  complete 
leftmost  G-derivations .  Given  a  complete  G- graph  T  with  initial  word  S  and 
terminal  ward  we  will  define  a  unique  leftmost  derivation  F(t)  ■  Q^,  ...,0^ 
where  0^  ■  S,  together  with  a  corresponding  sequence  F*(T)  -  T.^,  ...,Tn  of 
initial  subgraphs  of  T  such  that  is  the  terminal  ward  of  each  T^  and  Tq  ■  T 

We  will  show  that  if  F(t)  »  F(T7),  then  T  is  congruent  to  l'. 

Let  ■  (0,p)  if  p  is  the  initial  string  (necessarily  of  length  l)  of  T. 

■  S  is  the  terminal  word  of  T^.  Suppose  we  have  the  initial  subgraphs 
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T^,  ...,T,  and  a  leftmost  (partial)  derivation  where  each  ry^  is  the 

terminal  word  of  T\.  If  T,  ■  T,  then  we  are  finished.  Otherwise,  let  the 

terminal  string  of  Tj  be  t^t,  where  rp(T^)  contains  only  external  letters  arvi 

co(t.j)  begins  with  an  internal  letter.  The  terminal  string  of  must  be  in 

that  form  because  there  must  be  a  production  of  T  applicable  to  its  terminal 

string,  and  we  have  adopted  the  convention  that  context  free  grammars  do  not 

have  productions  with  external  letters  on  the  left.  Let  t  *  pr  '  where  p  is  a 

node  and  cp(p)  •  X.  Then  ■»  cp( )X'd( t^)  •  p  is  not  a  terminal  node  of  T. 

Therefore  T  has  a  production  (p,  p) .  Let  T^+1  -  Tj  +  (p,  5).  Let 

-  cp(  T^)cp(  p)cp(  t^)  ,  aj+1  is  the  terminal  word  of  Tj+1,  and  if  o^,  ...,oe 

is  a  leftmost  derivation,  then  so  is  o^, . . . ,  Eventually  we  must  arrive 

at  Tn  «*  T  and  a  complete  leftmost  derivation  a^, , . . , o^.  This  completes  the 

construction  of  F  and  F ' ,  That  F(T)  ■  F(T*)  implies  T  congruent  to  T*  can  be 

seen  from  the  construction  Just  set  forth.  Obviously  T  and  t'  would  have  to 

have  the  same  number  n  of  productions.  Let  F/(T/)  »  T^,  . ..,Tj.  Clearly  is 

congruent  to  T^.  Suppose  Tj  is  congruent  to  Tj,  under  the  congruence  h.  If 

J  *=  n  we  have  the  assertion.  If  not,  then,  using  the  notation  of  the 

construction,  the  terminal  string  of  Tj  is  h(T^)h(p)h(-T\p .  cp(h( ) )  contains 

only  external  letters  and  co(h(p))  -  X.  Clearly  Tj+1  -  Tj  +  (h(p),p/)  for  same 

p\  o'  has  to  have  the  same  length  as  p  since  the  length  of  is  fixed. 

®(p/)  ■  ®(p)  in  order  to  obtain  Oj+1  as  terminal  word  of  Tj+^.  The  nodes  of  p 

and  p/  are  foreign  to  T.  and  T*  respectively,  so  there  is  no  difficulty  about 

J  J 

extending  h  so  that  h(p)  -  p#.  Clearly  the  extended  h  is  a  congruence  from 
Tj.i  t0  Tj.i* 
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We  also  need  to  know  the  converse,  that  if  T  Is  congruent  to  T  then 

F(T)  »  FCT7)*  It  suffices  to  show  that  F '(T)  and  F#(T#)  are  congruent  term 

by  term,  for  then  their  sequence  of  terminal  words  must  be  identical.  T^  and 

consist  Just  of  initial  strings  with  the  same  label  and  are  therefore 

congruent.  Suppose  T,  is  congruent  to  Tj.  These  are  initial  subgraphs  of  T 

and  T*,  hence,  there  being  at  most  one  possible  congruence  between  labeled 

graphs,  the  congruence  between  them  must  be  a  restriction  of  the  congruence  h 

from  T  to  T(.  T.  .  =  T.  +  (p,p)  where  p  is  the  leftmost  terminal  node  of  T. 

6uch  that  cp(p)  is  not  an  external  letter.  h(p)  is,  because  of  congruence,  the 

leftmost  terminal  node  of  T*  with  label  other  than  an  external  letter.  In 

J 

order  that  the  sequence  of  terminal  words  be  a  leftmost  derivation, 

Tj+1  ■  Tj  +  (h(p),p')  for  some  p  (h(p),h(p))  is  a  production  of  T#,  and  h(p) 
cannot  be  in  the  first  string  of  two  distinct  productions,  so  p'  *  h(p).  This 
clearly  entails  that  Tj+1  be  congruent  to 

It  is  clear  that,  given  any  leftmost  derivation  we  can  get  a  O-graph  T  such 
that  F(t)  is  that  leftmost  derivation.  This  is  clear  by  inspection  of  the 
general  construction  given  earlier  for  obtaining  G- graphs  from  G-derivations. 


It  follows  now  that  if  there  are  two  different  leftmost  G-derivations  of 


the  same  word,  then  there  are  two  complete,  incongruent  G-graphs  with  the  same 
terminal  ward.  That  is,  if  G  is  ambiguous  in  the  leftmost  derivation  sense, 
then  it  is  ambiguous  in  the  production  graph  sense.  Conversely,  two  incongruent 
complete  G-graphs  with  the  same  terminal  ward  must  give  rise  via  F  to  two 
distinct  leftmost  derivations  of  that  word.  That  is,  if  G  is  ambiguous  in  the 
production  graph  sense,  then  it  is  ambiguous  in  the  leftmost  derivation  sense. 
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Thus  we  have 

PROPOSITION  2.2.4.  A  context  free  gr armor  is  unambiguous  in  the  cense  of 
production  graphs  if  and  only  if  it  is  unambiguous  in  the  sense  of  leftmost 
derivations. 

It  can  be  shown  also  that  a  context  sensitive  grammar  is  unambiguous  in 
the  sense  of  production  graphs  if  and  only  if  it  is  unambiguous  in  the  sense 
of  linear  bounded  automata,  i.e.,  if  and  only  if  the  language  of  the  grammar 
is  the  language  accepted  by  some  linear  bounded  automaton  which  has  at  most 
one  accepting  computation  for  each  word. 
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3.  Context  Limited.  Grammars 


Let  us  now  formulate,  exactly,  a  definition  of  the  context  limited 
grammars,  and  then  prove  that  they  are  equivalent,  as  described  at  the 
beginning,  to  the  context  free  grammars. 


Notation.  Lower  than  or  equal  to  will  denote  a  partial  ordering  relation,  and 
lower  than  will  mean  lower  than  or  equal  to  and  not  equal  to. 


DEFINITION.  A  grammar  G  is  context  limited  if  there  exists  a  partial  ordering 
lower  than  or  equal  to  on  the  alphabet  of  G  such  that  for  each  production 
o  -  S  of  G,  every  letter  of  a  is  lower  than  some  letter  of  0. 

Note,  for  example,  that  if  is  lower  than  Y^,  X^  is  lower  than  Y, ,  and 
is  lower  than  Y^,  then  X^gX^  “*  *3*2*1  cou^  be  Q  production.  XX  -*  X  could 
not  be  a  production. 

If  G  is  a  context  limited  grammar  then  L(G)  is  called  a  context  limited 
language.  Okie  is  for  present  convenience  only,  since  we  will  show  that  they 
are  all  context  free. 

Note  that,  as  we  have  defined  it,  a  context  limited  language  cannot  contain 
the  mall  word  c.  We  could  incorporate  c  with  a  alight  change  in  the  definition, 
but  instead,  since  it  is  generally  agreed  that  the  presence  or  absence  of  e  is 
of  no  fundamental  importance,  let  us  agree  thv*  the  term  "language"  means  a 
set  without  e. 

The  context  limited  grammars  are  generalizations  of  the  grammars, 
introduced  by  Ginsburg  and  Greibach  [4],  such  that  every  production  has  an 
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external  letter  on  the  right.  These  gr armors  were  proved  in  [  4]  to  generate  context 
free  languages .  Also,  Ginsburg  and  Spanier  have  outlined  an  alternate  proof 
that  the  context  limited  languages  are  context  free,  in  which  proof  the 
Ginsburg- Greibach  result  Just  r»ntior*ed  is  applied.  This  proof  could  probably 
be  generalized  to  incorporate  the  preservation  of  unambiguity,  and  possibly 
an  easier  proof  than  the  present  one  could  be  so  constructed.  Note,  however, 
that  the  present  proof  would  be  quite  short  without  the  ambiguity  considerations. 

Example  1.  The  language  fancbp  :  p  is  the  integer  part  of  is  generated  by 
the  context  limited  grammar  with  productions  o  -  aaX,  XX  -•  b,  a  -»  c,  oX  -  c. 

Example  2.  To  illustrate  the  potential  utility  of  the  theorems  we  are  about 
to  prove,  we  use  them  to  give  a  short  proof  of  the  known  fact  that  if  a 
sequential  transducer  M  [3,  p.9ll  is  one-to-one  on  an  unambiguous  context  free 
language  L,  then  M(l)  is  an  unambiguous  context  free  language  (a  slight 
generalization  of  theorem  1  of  [6],  which  is  for  generalized  sequential 
machines).  We  assume  without  loss  of  generality  that  the  input  and  output 
alphabets  of  M  are  disjoint.  Let  pQ  be  the  start  state  of  M.  Then  pqL  is 
an  unambiguous  context  free  language.  By  Theorem  2  (below),  pQL  is  generated 
by  an  unambiguous  context  limited  grammar  G,  the  alphabet  of  which  we  can 
assume  is  disjoint  from  the  output  alphabet  of  M.  We  simply  adjoin  to  G, 
productions  corresponding  to  the  moves  of  M,  so  that  if  M  can  go  from  po  to  Bq 
in  one  move,  p  and  q  being  states,  we  adjoin  a  production  po*  -*  Bq.  We  also 
adjoin  pa-  -»  B  in  order  to  maxe  it  possible  to  get  rid  of  the  state  symbol  at 
the  end  of  the  derivation:  note  that  if  per  -*  B  is  used  too  soon,  then  a 
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complete  derivation  is  not  possible,  because  there  will  still  exist  internal 
letters  but  no  production  to  handle  them.  The  required  partial  ordering  is 
obtained  by  extending  that  for  G,  so  that  every  letter  of  G  is  lover  than  every 
output  letter  of  M.  Unambiguity  follows  from  unambiguity  of  G  and  the  one-to- 
one  property  of  M. 

The  next  Proposition  is  the  easier  direction  of  the  main  theorems. 

PROPOSITION  3*1*  If  G  is  a  context  free  grammar,  then  there  exists  a  context 
limited  grammar  H  ouch  that  L(H)  *  L(G),  and  if  G  i6  unambiguous  then  so  is  H. 

Proof.  The  main  result  of  [8]  is  that  every  context  free  grammar  reduces, 
with  unambiguity  preserved,  to  one  in  which  all  productions  are  of  the  form 
X  -•  a» 

where  a  is  an  external  letter  and  X,  of  course,  is  not  external.  Under  the 
partial  ordering  that  makes  every  internal  letter  lower  than  every  external 
letter,  such  a  grammar  is  context  limited. 

Now  let  us  reduce  each  context  limited  grammar  to  a  context  free  one.  We 
will  need  a  few  preliminary  lemmas. 

First  let  us  recall  the  well-known  fact  that  any  partial  ordering  can  be 
extended  to  a  linear  ordering. 

PROPOSITION  3.2.  If  G  is  a  context  limited  grammar,  then  G  is  context  limited 
with  respect  to  some  linear  ordering  of  its  alphabet  in  which  every  internal 
letter  is  lower  than  every  external  letter. 
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Proof.  We  first  obtain  a  linear  ordering  of  the  internal  letters  by  extending 
the  given  partial  ordering  to  a  linear  ordering,  and  then  restricting  the 
latter  to  the  internal  latter s.  Then  ve  obtain  a  linear  ordering  of  the 
external  letters  in  the  sane  way.  Then  ve  combine  the  tvo  into  a  single  linear 
ordering  in  which  every  internal  letter  is  lover  than  every  external  letter. 

Nov  there  is  no  need  to  have  the  relation  ”a  lover  than  X"  for  an  external 
letter  a,  for  by  definition  the  external  letters  do  not  occur  on  the  left  in 
productions.  So  we  can  assume  that  the  given  partial  ordering  has  this 
property.  Ihen  the  linear  ordering  that  ve  Just  constructed  1b  an  extension 
of  the  given  partial  ordering,  hence  0  must  be  context  limited  with  respect  to 
it. 

If  G  is  a  grammar,  either  context  limited  or  context  free,  ve  call  G 
unambiguous  modulo  the  set  R  if  whenever  tvo  complete  G- graphs  T^  and  have 
the  same  terminal  word  ^  where  v  is  in  R,  than  is  congruent  to  Tg. 

Wc  could  prcr'e  directly  that  if  a  context  limited  grammar  is  unambiguous 
modulo  a  regular  set  R,  then  its  language  intersected  with  R  is  unambiguous. 

But  this  result  will  follow  an  a  byproduct  of  the  main  result.  We  will,  though, 
need  to  prove  it  in  the  special  case  where  the  granmar  is  context  free. 

PROPOSITION  3*3»  If  a  context  free  grammar  0  is  unambiguous  modulo  a  regular 
set  R,  then  L(G)  0  R  is  an  unambiguous  context  free  language. 

Proof.  Let  A  -  (K,  E,6,qQ,F)  be  a  finite  automaton  (i.e.,  finite-state  acceptor 
[3,  P*^7j)  such  that  R  is  the  set  of  words  accepted  by  A. 

We  define  a  context  free  grammar  H  as  follows.  The  alphabet  of  H  is  the 
external  alphabet  of  G  together  with  the  triples  (pXq),  p  and  q  states  of  A  and 
X  in  the  internal  alphabet  of  G. 
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For  two  states  p  and  q  of  A  and  a  word  O'  on  the  alphabet  of  H,  call  <y 
(p,q)- connected  if  any  of  the  following  hold: 

(i)  a  contains  only  input  letters  of  A  and  6(p,a)  «  q. 

(ii)  Of  =  (pXq)  for  some  X. 

(iii)  There  are  words  o^,...,o^  and  states  p^,...,pk+^  such  that  a  «  o^...^, 
and  for  each  i  *  k  o .is  (p^p  connecte<*  according  to  (i)  and  (ii). 

Let  be  the  initial  letter  of  H,  where  Sp  is  distinct  from  all  letters 


H 


mentioned  so  far.  Let  S  be  the  initial  letter  of  G.  For  each  final  state  p  of 
A,  let  H  have  the  production 


SH  -  <'J0Sp)- 

For  a  word  or  on  the  alphabet  of  H,  let  o  be  the  word  on  the  alphabet  of  G 
obtained  in  the  obvious  way  by  replacing  each  (pXq)  by  X  and  leaving  external 
letters  unchanged. 

For  each  internal  letter  (pXq)  of  H,  and  each  (p,q)-connected  word  B  such 
that  X  -♦  *0  is  a  production  of  G,  let 


(pXq)  -  B 


be  a  production  of  H. 

It  is  obvious  that  if  or  is  (p,q)-connected  and  ot  |-  then  B  is  (p,q)- 

n 

connected. 

Now  if  a  word  w  on  the  external  alphabet  of  A  is  (qQ,p) -connected  where  p 
is  a  final  Btate,  then  w  is  in  R. 

From  these  two  assertions;  L(H)  cR. 

The  productions  of  H  being  homomorphic  to  productions  of  G  via  the 

homomorphism  which  takes  o'  to  a,  except  for  the  productions  involving  S„, 

u 
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it  is  clear  that  if  v  has  only  external  letters  and  (q  Sp)  It:  w  then  S  k  v. 

ALl  complete  H-derivations  are  of  the  form  Su,  (q  Sp),...,w.  Therefore 

n  o 

L(H)  cl(g).  Therefore 

L(H)  c  l(G)  0  R. 

On  the  other  hand,  if  any  word  a  on  the  alphabet  of  H  is  (p,q) -connected, 
and  if  X  t  b,  then  (pXq)  |~  <v.  For,  we  have  either  X  ■  »  or 

u  n 

X  -*G  .  ‘wkYkwk+1  where  the  v4  are  words  on  the  external  alphabet  and  the  Y 

are  internal  letters,  with  Y^  0.^  and  •  *vk^kwk+i  "  a*  P1*0111  this  it  is 

clear  that  we  con  prove  the  assertion  by  irxiuction  on  the  length  of  the 
derivation.  In  the  induction  step,  we  observe  that  if  is  (pi,pi+^)-connected 

then  (P^P^)  I Ba.  This  together  with  (pXq)  -«H  VP1Y1P2)  *  *  *  VpkVk+lK+l 
shows  that  (pXq)  I—  ®. 

New  any  word  w  in  R  0  L(G)  is  (q^pj-connected  for  some  final  state  p. 
Therefore  if  S  |-^  then  (qQSp)  w  =  w.  This  and  (qQSp)  show  that  w 

is  in  L(H) .  Thus 

R  0  L(G)  <=  L(H). 

Therefore 

L(H)  =  L(G)  n  R. 

The  productions  of  H  being  hamomor^ic  in  the  sense  mentioned  above  to 
the  productions  of  G,  every  complete  H- graph  is,  ignoring  the  initial  node, 
homomorphic  in  the  same  sense  to  a  complete  G-graph.  That  is,  the  complete 
G- graph  is  obtained  by  deleting  the  initial  node  and  production  and  replacing 
each  label  (pXq)  by  X.  Therefore  if  T^  and  T^  are  complete  H- graphs,  with  the 
same  terminal  word,  then  there  is  a  one-to-one  function  h  from  the  noninitial 
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nodes  of  onto  the  noninitial  nodes  of  Tg  such  that  h(P)  is  a  production  of 
Tg  whenever  P  is  a  production  of  T^,  and  such  that  for  every  noninitial  node  s 
of  if  to(s)  -  (pXq.)  then  cp(h(s))  -  (p^q^),  and  if  <p(s)  ■  a  where  a  is  an 
external  letter  then  (o(h(s))  «=■  a.  In  the  former  case  we  have  that  the  terminal, 
word  of  T,  is  w„w„v_  where  (pXq)  |-=  v0,  and  w,  is  (q  ,p)-connected  (with  the 

Jl  jl.  d  j  ad  J.  O 

convention  that  c  is  (r,  reconnected  for  any  r),  and  w^  is  (q,  reconnected  for 
some  final  state  r.  The  terminal  word  of  is  also  w^w^,  and  (p^Xq^  !-  w^ 
(we  rely,  of  course,  on  the  unambiguity  modulo  R  of  G  for  the  latter  statement). 
Since  w  has  been  ascertained  from  to  be  (p,q)-connected,  and  since  A  is 
deterministic,  ve  must  have  p^  =  p  and  q^  c  q.  Therefore  h  must  be  a 
congruence  from  T^  to  T^  when  extended  to  the  initial  node.  Therefore  H  is 
unambiguous.  This  completes  the  proof. 

For  each  context  limited  grammar  0  let  the  numbers  1^(G)  and  12(g)  be 
defined  as  follows. 

First,  let  the  length  X(ty  -*  0)  of  a  production  at  -  0  of  G  be  lol  -  1, 
where  |  or|  is  the  length  of  at. 

Call  the  production  at  -*  £  of  G  a  splitting  production  of  G  if  0  contains 
an  external  letter.  Otherwise  call  at  —  0  a  nonsplitting  production.  Let 

1^(G)  ■  E(X(P)  :  P  a  nonsplitting  production  of  G) 

1^(G)  ■  EfX(p)  :  P  a  splitting  production  of  0) 

Clearly  G  is  context  free  Just  in  case  1^(G)  »  12(G)  -  0. 

Call  a  production  P  context  bound  if  X(P)  >  0. 

The  program  is  to  reduce  12(g)  if  it  is  not  0,  and,  if  1,(G)  -  0  <  l^G), 
then  to  reduce  1^(G).  The  latter  reduction  will  be  considered  next. 
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PROPOSITION  3*4.  Let  G  be  a  context  limited  grammar  with  1^(G)  >  0  and 
1  (G)  ■  0.  Let  R  be  a  regular  set.  Then  there  exists  a  context  limited 
grammar  H  satisfying  the  following: 

(i)  i^(h)  <  i1(g) . 

(ii)  L(H)  -  L(G). 

(iii)  If  G  is  unambiguous  modulo  R  then  H  is  unambiguous  modulo  R. 

Proof.  By  Proposition  3*2,  we  can  assume  that  the  partial  ordering  on  the 
alphabet  of  G  is  a  linear  ordering.  Then  every  set  of  letters  has  a  highest 
member,  where  X  higher  than  Y  means  Y  lower  than  X. 

We  first  want  to  identify  the  set  of  context  free  letters  of  G.  These 
are  the  letters  such  that,  once  one  of  them  appears  in  a  derivation,  it  must 
give  rise  to  a  sequence  of  context  free  productions  culminating  in  an  external 
letter.  Thus  a  context  free  letter  constitutes  a  barrier  that  splits  the 
derivation  into  two  independent  parts.  The  set  of  context  free  letters  is  the 
union  of  the  sets  which  are  defined  inductively  as  follows: 

C  is  the  set  of  external  letters, 
o 

Given  C^,  X  is  in  C^+1  if  both  of  the  following  hold: 

(i)  X  is  not  in  o-  for  any  context  bound  production  a  -*  3. 

(ii)  For  each  production  X  -*  P^Ydg,  where  Y  is  the  highest  letter 
in  01YB2,  Y  is  in  C^. 

Note  that  c  C1+^  for  each  i. 

We  define  the  height  of  a  context  free  letter  X  to  be  the  least  i  such 
that  X  is  in  C, . 
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For  each  context  free  letter  X  we  define  the  term  tree  of  X  inductively 
ns  follows.  If  X  is  an  external  letter,  then  a  tree  of  X  is  a  labeled  production 
graph  with  a  single  node  labeled  X.  Once  we  have  defined  the  trees  of  X  for 
all  X  of  height  at  most  n,  we  extend  the  definition  to  letters  of  height  n  +  1 
as  follows.  Let  X  be  a  context  free  letter  of  height  n  +  1  and  let  X  -*  o^Yov 
be  a  prodiiCtion  of  G  where  Y  is  the  highest  letter  of  o^Yo^,  and  Y  is  not  in 
Then  Y  is  context  free  of  height  at  moBt  n.  Let  p  be  a  node  and  let  cr^qap 
a  string  of  nodes  such  that  cp(p)  =  X,  rr(a^)  ■  cp(q)  ■  Y,  w(a,^)  n  o^.  Let  T 

be  a  tree  of  Y  with  initial  string  q  and  no  other  nodes  in  common  with 
(PjC^qa,  ).  (Trees  of  X  are  production  graphs,  of  course.)  Then 
({(Pi^qag)  },q)  +  T  is  a  tree  of  X. 

It  is  clear  that  if  p  is  a  node  of  a  complete  G-graph  T  with  cp( p)  ■  X, 
where  X  is  a  context  free  letter,  then  p  is  the  initial  node  of  a  subgraph  of 
T,  which  is  a  tree  of  X.  For  all  of  the  necessary  productions  are  there,  and 
p  is  a  string  of  T. 

Let  us  call  a  letter  context  bound  if  it  is  not  context  free. 

There  exists  a  context  bound  nonsplitting  production  a  -•  8  such  that  some 
letter  in  9  is  context  free.  For,  suppose  there  were  no  such  production.  We 
will  show  that,  given  any  context  bound  letter  X,  there  exists  another  context 
bound  letter  Y  that  is  higher  than  X.  Either  X  belongs  to  a  context  bound 
production  o^Xo^  "*  in  which  case  the  highest  letter  Y  of  9  is  higher  than  X 
and  by  hypothesis  context  bound,  or  else  there  is  a  production  X  ■*  0  in  which 
the  highest  letter  Y  of  8,  again  higher  than  X,  is  context  bound.  Hence  G 
could  not  contain  any  context  bound  letter  lest  there  be  an  infinite  ascending 
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sequence  of  letters,  impossible  in  a  partial  ordering  on  a  finite  set.  But  G 
does  contain  a  context  bound  letter  because  1^(G)  is  positive,  and  therefore 
there  is  at  least  one  context  bound  nonsplitting  production  a  -  3,  «nd  our 
hypothesis  has  it  that  the  highest  letter  in  8  is  context  bound.  Hence  the 
hypothesis  is  false. 

That  is,  G  has  a  nonsplitting  production 
o'  -  81XB2, 

where  | or |  >1,  X  is  not  in  B  ,  and  X  is  context  free. 

Let  be  ttie  Bet  811  terminal  words  of  trees  of  X.  Note  that 

all  the  are  distinct  even  though  two  incongruent  trees  of  X  may  have  the 
same  terminal  word. 

We  define  the  grammar  H  to  be  Just  like  G  except  that  H  lackB  the 
production  a  -  R^XBg  and  has  instead  all  the  productions 
a  -  P1V102,  1  *  i  *  k. 

Each  contains  an  external  letter,  bo  each  production  a  -  BlV2  10  a 
splitting  production,  hence  contributes  nothing  to  1^(H).  ®  -•  8jXB2  contributes 

at  least  1  to  1^(G)  and  is  absent  from  H.  Therefore  1^(H)  <  1^(G). 

Let  T  «  ( $, u.)  be  a  G-graph.  Let  T^  ■  ($^,o)  be  a  subgraph  of  T.  Call  T^ 
a  removable  subgraph  of  T  if  T^  ■  (o^p-r^)  +  T^  where  T^  is  a  tree  of  X  with 
initial  node  p,  cp(o)  -  a,  cpC*^)  «  B^,  cp(p)  ■  X,  cp( t^)  ■  B2*  We  are  interested 
in  forming  an  H-graph  by  replacing  all  removable  subgraphs  by  productions,  so, 
recalling  2.1.21,  we  are  interested  in  the  separability  of  two  distinct 


removable  subgraphs. 
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To  establish  this  separability,  we  need  to  show  that  for  every  context 
free  letter  Y,  two  distinct  trees  of  Y  which  are  subgraphs  of  T  have  no 
productions  in  common.  We  proceed  by  induction  on  the  height  of  Y.  If  Y  haB 
height  0,  then  Y  is  an  external  letter,  hence  no  tree  of  Y  has  any  productions 
at  all,  so  the  assertion  holds  in  this  case.  Suppose  the  assertion  holds  for 
heights  at  most  n,  and  let  Y  have  height  n  +  1.  Let  and  be  trees  of  Y 
and  subgraphs  of  T.  Let  ■  (ppalqiCTP  +  where  has  initial  node 
and  is  a  tree  of  cp(q]L).  Let  +  u2  wbere  U2  ha8  node 

q2  and  is  a  tree  of  cp^).  If  the  initial  productions  of  and  T 2  are  the 
same  tnen  q^  -  q^  because  these  nodes  correspond  to  the  leftmost  occurrence  of 
the  highest  letter  of  the  word.  If  cp(q^)  has  height  then  and  U2  have  no 
productions,  in  which  case  »  1^.  If  cp(q^)  has  positive  height,  then  and 
U2  have  initial  productions  with  q1  in  the  first  string  of  each.  Since  T  can 
have  only  one  production  with  q1  in  its  first  string  (2.1.2),  by  inductive 
hypothesis  and  U2  are  identical.  Hence  and  T2  are  identical. 

Suppose  the  initial  production  of  is  in  U2»  To  remove  this  possibility, 
let  us  show  that  if  V  is  a  tree  of  a  context  free  letter  Z  and  (p,o)  is  a 
noninitial  production  of  V,  then  cp(p)  has  height  less  than  the  height  of  Z. 

The  case  of  height  0  is  trivial,  and  if  the  assertion  holds  for  heights  up  to  n, 
and  Z  has  height  n  +  1,  then  V  ■  (q, t)  +  V*  where  Vf  is  a  tree  of  a  letter  z' 
of  smaller  height  than  Z.  (p,o)  is  either  the  initial  production  of  V ' ,  in 
which  case  cp(p)  ■  z'  has  smaller  height  than  Z,  or  else  (p,o)  is  a  noninitial 
production  of  V #,  in  which  case  cp(p)  has  smaller  height  than  Z ' ,  hence  smaller 
height  than  Z.  Thus  if  the  initial  production  of  T^  is  in  we  are  forced  to 
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the  conclusion  that  Y  has  smaller  height  than  Y.  For  the  same  reason,  the 
initial  production  of  is  not  in  U^. 

With  these  facts  in  mind,  suppose  (p,c)  is  a  production  belonging  to  both 
T^  and  T^,  with  cp(p)  having  the  largest  possible  height  consistent  with  this 
property.  (p,o)  is  not  the  initial  production  of  either  or  T^.  Hence 
contains  a  production  (p/,o/)  with  p  in  a*  and  cp(p  /)  of  greater  height  than 
co(p)  *  But  p  cannot  appear  in  the  second  strings  of  two  distinct  productions, 
so  T2  must  also  have  the  production  (p/,o/)*  This  contradicts  the  maxinal 
choice  of  cp(p). 

We  conclude  that  two  trees  of  Y,  subgraphs  of  T,  are  either  identical  or 
share  no  productions. 

Now  we  can  show  that  two  distinct  removable  subgraphs  have  no  productions 
in  common.  Let  arvl  be  removable  subgraphs  of  T.  We  can  say 

T1  "  ^al,TlPlTl^  +  and  T2  -  (v2>T2p2TP  +  U2 
where  'p(o1)  ■  w(a2)  «  a,  cp(t^)  -  cp( t^)  -  B^,  cp(p1)  -  co(p2)  -  X, 
cp(r^)  cp(t")  ■  d2,  and  and  U2  are  trees  of  X  with  initial  nodes  p^  and  p2 
respectively.  If  p^  -  p2,  then  and  Ug  must  have  the  same  initial  production, 
since  there  is  only  one  production  with  p^  in  its  first  Btring.  Hence  and 
U2  are  identical  by  the  assertion  proved  previously.  Also,  if  p^  »  p2,  since 
p^  can  be  in  the  second  string  of  only  one  production, 

(al,TlPlTl)  "  (ct2^T2P2T2)* 

Therefore  to  have  T^  and  T2  distinct, we  need  p1  /  p2* 

(°1,  TlPlTl)  ^  (a2,T2p2T"/)  tecQUBe  Bua^L  P2  BTe  required  to  correspond  to 
the  leftmost  occurrence  of  X  in  cp^p^T^)  and  cp(t^p2t2)  respectively,  so  that 
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the  equality  of  the  two  strings  would  entail  -  p2.  Neither  initial  produc¬ 
tion  of  one  is  a  nonlnitial  production  of  the  other,  for  both  initial  productions 
are  context  bound  and  all  noninitial  productions  are  context  free.  and  IT 
are  distinct  trees  of  X  and  therefore  have  no  productions  in  common.  Therefore 
and  T2  have  no  productions  in  common. 

has  initial  string  o^  and  Just  one  initial  production  (ct1,t^p1t2), 
hence  no  initial  node  of  T^  is  terminal.  Similarly,  no  initial  node  of  T2  is 
terminal,  and  T2  has  only  one  initial  production.  Therefore,  by  2.1.22, 

T^  and  T2  are  separable. 

Now  in  a  complete  0- graph, every  production  (a, t)  with  label  a  -  &^X02 
must  belong  to  a  removable  subgraph,  because  the  node  labeled  X  must  be  the 
initial  node  of  a  tree  of  X,  as  noted  earlier,  and  a  is  a  string  of  T  by  2.1.18. 
Therefore  if  we  replace  all  the  removable  subgraphs  with  productions  of  H,  we 
will  have  an  H- graph. 

More  precisely,  let  T^, . .  .,T^  be  all  the  removable  subgraphs  of  T,  with 

initial  strings  ct^, ...,0^  and  terminal  strings  T^,...,Tk.  For  each  J  the 

terminal  word  of  Tj  is  cp(Tj)  -  3^^  Bg*  the  production  graph  Uj  ue 

J 

(f(aVTj)  W*  Eacb  Uj  1,8  0X1  H”8 raph,  for  the  only  production  label  in  Uj  is 
a  -*  32<  No  Uj  has  any  noninitial  nonterminal  nodes,  so  the  Uj  contain 

no  noninitial  nonterminal  nodes  in  common  with  each  other  or  with  T.  Therefor* 

T^,  ...,Tk  are  replaceable  by  ...,Uk  in  T. 

Denote  by  F(T)  the  graph  formed  by  replacing  T^,  ...,Tk  with  U^,  ...,U^  in  T. 

By  2.1.16  and  2.1.21,  F(T)  has  the  same  initial  and  terminal  strings  as  T, 
and  by  definition  its  labeling  agrees  with  that  of  T  on  the  initial  and  terminal 
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nodes.  Hence  F(T)  has  the  same  initial  and  terminal  wards  as  T.  F(T)  is  an 
H- graph. 

We  note  in  passing  that  this  shows  that  L(G)  cL(h). 

Suppose  h  is  a  congruence  defined  on  T.  Let  us  show  that  F(h(T))  is 

congruent  to  F(T).  h^), . .  .,h(Tk)  are  all  the  removable  subgraphs  of  h(l  , 

their  initial  strings  are  h(o1), . .  .,h(ak)  respectively,  and  their  terminal 

strings  are  h^), .  . .,h(rk)  respectively.  Clearly  (  [(ct^  t^)  },0j)  is  congruent 

to  (f(h(a,),h(Tj))  },h(aj)).  That  is,  Uj  is  congruent  to  h(Uj).  F(h(T))  is  the 

graph  formed  by  replacing  h^), .  ..,h(T^)  by  h(U^),  ...,h(Uj)  in  h(T).  We  have 

the  situation  described  in  2.2.3*  Therefore  F(T)  is  congruent  to  F(h(T)). 

Conversely,  let  T  be  a  complete  H-graph.  Let  (o^, t^), . . ., (ak, t^)  be  all 

of  the  productions  of  T  having  labels  of  the  farm  0/  -*  B^y.^.  That  is, 

cp(a.)  *  cr,  «j(t.)  «  B-,y.  0O.  For  each  j  construct  the  graph 
J  0  -L  <- 

Tj  *  (f(°j'p#jqJpj)kaj)  +  V 

where  rp(p')  ■  B^,  cp( q.^ )  -X,  <p(p  )  ■  Bg,  Vj  is  a  tree  of  X  with  initial  string 

q  ,  and  terminal  word  y  .  Moreover,  we  choose  p  *,  p"  and  the  terminal  string 
J  J  J 

of  V.  w>o  that  the  terminal  string  of  T.  is  t,.  We  choose  the  rest  of  the  nodes 
J  J  J 

of  the  V,  to  be  distinct  from  all  the  nodes  of  T  and  so  that  no  two  V,  have 
nodes  in  common  other  than  initial  or  terminal  nodes. 

Any  two  productions  and  considered  as  subgraphs,  are 

separable . 

Therefore  (cr^, t^),  ...,(ok,Tk),  considered  as  graphs,  are  replaceable  by 
T^...,^  in  T.  Let  l'  be  the  graph  formed  by  the  replacement  of  the  former 
by  the  latter  in  T.  Tf  is  a  0-graph  because  all  production  labels  of  the  form 
ex  -*  B^.Yi®2  ilttve  been  remov'e<i*  T*  has  the  same  initial  and  terminal  words  as  T. 
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(PI)  X  -  Bjb, 

(P2)  or  -  c02. 

The  nonsplitting  productions  of  H  are  exactly  the  nonsplitting  productions 
of  G,  so  l-j^H)  *  l^G).  The  splitting  productions  of  H  are  Just  those  of  G 
except  that  H  has  o'  -  cBg  and  X  -  B^b  instead  of  Xa  -  B^aBg.  length  of 

X  -•  B-jb  is  0,  and  the  length  of  a  -•  cBg  is  -  1,  whereas  the  length  of 
X'Y  -  ^aBg  is  I o |*  Hence  12(h)  <  12(G). 

Let  V  be  the  alphabet  of  G.  Let 
r'  =  V*  U  (V*  be  V*)*. 

That  is,  R '  is  the  set  of  all  words  on  V  U  fb,c)  such  that  b  and  c  occur  only 
in  subwords  be.  Clearly  R#  is  regular. 

Let  M  be  a  sequential  machine  which  replaces  any  Y  /  b,c  by  Y,  and  replaces 
be  by  a.  Let 

r2  «  m"1(r1)  Hr'. 

M-1(R1)  is  regular,  as  is  proved  in  [5,  p.l86],  and  it  is  well  known  that 
the  intersection  of  two  regular  sets  is  regular.  Therefore  R2  is  regular. 

We  will  use,  merely  as  a  notational  device,  the  grammar  h'  which  is  Just 
like  H  except  that  H*  has  the  production  be  a.  We  also  define  a  graph  U 
corresponding  to  the  productions  X  -•  B^b,  a  —  cBg,  be  -*  a.  More  precisely, 

U  *  (fPu>TuqtP'  (aU,rUPlP'  ^qUrU’*U^,Pll°lP 
where  wtpy)  -  X,  co(ay)  -  a,  cp(Ty)  -  B^  «p(qy)  -  b,  cp(Py)  -  Bg,  cp(ry)  -  c, 
coCsy)  -  a.  U  is  an  H '-graph. 

I  claim  that  if  an  H  '-graph  T  such  that  b  and  c  do  not  occur  in  the  initial 
word  of  T  has  a  production  (qr,s)  with  cp(q)  -  b,  cp(r)  ■  c,  cp(s)  -  a,  then  the 
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Let  us  note  in  passing  that  this  shows  L(H)  cl(g),  Therefore,  since  we 
have  the  other  inclusion,  L(H)  -  L(G). 

T.,  . ..,Tk  re  all  of  the  removable  subgraphs  of  T,  far  the  production 
label  a  -  0^X82  can  be  in  T*  only  by  being  in  one  of  the  T^.  The  Tj  have 
initial  strings  and  terminal  strings  t^.  Therefore  it  i6  clear  that 
F(t')  «  T. 

Now  let  T^  and  T^  be  two  complete  H-graphs  with  the  same  terminal  ward  w 
where  w  is  in  R.  There  exist  two  complete  0- graphs, T^  and  T^,with  terminal 
word  w,  such  that  F(T^)  =  T^  and  F(T^)  ■  T^.  If  G  is  unambiguous  modulo  R,then 
T^  and  T^  must  be  congruent.  We  have  seen  that  this  implies  that  F(T^)  is  congruent 
to  F(T^). 

This  establishes  (iii)  and  finishes  the  proof. 

PROPOSITION  3*5 •  Let  G  be  a  context  limited  grammar.  Let  1^(G)  >  0.  Let  R.„ 

be  a  regular  set.  There  exists  a  context  limited  grammar  H,  a  sequential 

machine  [3,  p«93l  M, and  a  regular  set  R^  satisfying  the  following: 

(i)  l(g)  n  r1  -  m(l(h)  n  r2). 

(il)  If  G  is  unambiguous  modulo  R^  then  H  is  unambiguous  modulo  R^. 

(ill)  If  G  is  unambiguous  modulo  Rp  then  M  is  one-to-one  on 
L(H)  0  Rg. 

(iv)  l^H)  -  l^G)  and  lg(H)  <  12(g). 

Proof.  Since  12(G)  >  0,  G  has  a  splitting  production  Xar  -*  P^aP2  where  a  is 
an  external  letter  and  |orJ  >0. 

Let  b  and  c  be  letters  not  in  the  alphabet  of  G.  Let  H  be  exactly  like  G 
except  that  H  has  b  and  c  in  its  external  alphabet,  and  H  lacks  the  production 
Xo  -  and  has  instead  the  productions 
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production  (qr,s)  belongs  to  one  and  only  one  subgraph  of  T  which  is  congruent 
to  U.  For,  q  cannot  be  an  initial  node  of  T,  so  q  must  be  in  the  second  string 
of  some  production  of  T.  Such  a  production  must  have  the  form  (p,  rq)  with 
•s(p)  ■  X,  cp(t)  -  0^.  Similarly,  there  must  be  a  production  (o,rp)  with 
cp( ct)  -  cr,  cp(p)  -  Bg.  Now 

Tx  -  (f(p,Tq),  (o,rp),  (qr,s)  ),po) 

is  a  labeled  graph  congruent  to  U.  We  could  not  find  another  such  congruent 
image  of  U  lest  p  or  q  appear  in  the  second  strings  of  two  distinct  productions 
of  T.  is  a  subgraph  of  T  provided  its  terminal  string  TSp  is  a  string  of  T 
(2.1.15).  Tq,  rp,  and  qr  are  strings  of  T  because  they  are  strings  of 
productions  of  T,  hence  by  two  applications  of  2.1.9  we  learn  that  rqro  is  a 
string  of  T.  (qr,s)  being  a  production,  rsp  must  also  be  a  string  of  T,  hence 
is  a  subgraph  of  T. 

Also,  if  Tp  is  a  congruent  image  of  U  and  a  subgraph  of  T,  and  if  Tg  is 
distinct  from  T^,  then  and  T2  are  separable.  To  see  this,  let  V  be  the 
smallest  initial  subgraph  of  T  cc.  taining  all  the  productions  of  T^.  (For 
the  existence  of  V,  see  2.1.17.)  Then  the  terminal  productions  of  V  are  Just 
the  terminal  productions  of  T^.  Nov  all  the  productions  of  T^  are  terminal 
in  T^.  Therefore  the  terminal  productions  of  V  are  Just  the  productions  of  T^. 
Removing  Just  these  productions  yields  the  initial  subgraph  of  T,  which  i'j 
called  V  -  T^.  Now  T^  and  Tg  have  no  productions  in  common.  For,  if  they  did, 
then  either  q  or  r  would  have  to  be  a  node  of  Tg,  since  every  production  of  T^ 
contains  either  q  or  r.  So  Tg,  being  congruent  to  U,  would  have  to  contain  a 
production  with  q  or  r  in  the  first  string.  There  is  only  one  such  production 
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In  T,  namely  (qr,s) .  We  have  already  seen  that  i6  the  only  subgraph  of  T 
congruent  to  U  which  contains  (qr,s),  and  therefore  and  have  no  produc¬ 
tions  in  cccrnon.  Therefor^  either  V  -  contains  all  the  productions  of 
or  none  of  them.  In  the  former  case,  V  -  T^  establishes  the  asserted 
separability;  in  the  latter  case,  V  establishes  it. 

Let  T7  be  an  H7-graph,  where  b  and  c  occur  neither  in  the  initial  nor  in 
the  terminal  word  of  T#.  Let  T^, ...,Tk  be  all  the  subgraphs  of  T7  which  are 
congruent  to  U.  If  (ct, t)  is  a  production  of  T7  and  cp(a)  -*  cd(t)  is  not  in  G, 
then  (a,  t)  must  belong  to  one  of  the  T^.  For,  if  ro(a)  -*  cp(t)  is  not  in  G, 
then  o(ct)  or  cp(t)  contains  b  or  c.  If,  for  example,  r  =  t7q  with  cp(q)  ■ 
then,  since  b  is  not  in  the  terminal  word  of  T7,  there  must  be  a  production 
(qr,s)  with  cp(r)  »  c,  cp( s )  »  a.  (qr,s)  belongs  to  exactly  one  of  the  T^  and 
that  contains  a  production  with  q  occurring  in  the  second  string.  Since 
there  is  only  one  production  with  q  in  its  second  string,  (a,T7q)  =  (a, t)  must 
be  in  T^  The  other  cases  are  similar. 

It  follows  that  if  we  replace  Tp  ...,Tk  by  G- graphs  U^,  ...,Uk  in  T7,  then 
the  resulting  graph  will  be  a  0- graph.  Letting  cr^,  and  be  respectively  the 
initial  and  terminal  strings  of  for  each  1,  then  1,^)  is  a 

G-graph  for  each  i,  for  the  only  production  label  of  is  a  -  ^a^2*  ®ie 
graph  formed  by  replacing  T^,  . . . ,  T^  by  U^, . . . , in  T7  (2.1.21)  is  a  G-graph 
with  the  same  initial  and  terminal  wards  as  T7.  Call  this  G-graph  F  7(T 7) . 

Conversely,  given  a  G-graph  T  we  can  construct  an  H7-graph  T7  such  that 
T  »  F7(T7).  For,  let  (a^,^), .. .,(0^,^)  be  all  the  productions  of  T  such  that 
cp( )  -  Xar,  cp( )  ■  B^aBg.  These  productions,  considered  as  graphs,  constitute 
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separable  subgraphs  of  T,  that  is  and  (°j> Tj)  considered  as  graphs  are 

separable  if  i  ^  J.  With  the  usual  precautions  about  distinctness  of  new  nodes, 
we  construct  graphs  T^  ...,Tk  all  congruent  to  U,  where  each  has  initial 
string  a ^  and  terminal  string  t^.  The  graph  formed  by  replacing 
(oi, Ti),...,(ak^Tk)  by  T1,...,Tk  in  T  is  an  H '-graph  t'.  The  only  subgraphs 
of  T '  congruent  to  U  are  Tp...,Tk»  t'  has,  by  2.1.21,  the  same  initial  and 
terminal  words  as  T,  hence  t'  contains  neither  b  nor  c  in  its  initial  or 

terminal  word,  because  b  and  c  are  not  in  the  alphabet  of  0.  It  is  clear  by 

inspection  of  the  construction  of  F#  that  F'(t')  -  T.  By  2.2.3, 
it  is  clear  from  this  construction  that  if  f'(T')  is  also  T,  then  T*  is 
congruent  to  l'. 

Fran  this  It  follows  that  L(h')  ■  L(0). 

Suppose  that  G  is  unambiguous  modulo  the  regular  Bet  R^.  Then  h'  is 
unambiguous  modulo  R^.  For,  suppose  T^  and  Tg  are  tvo  H '-graphs  with  the 

same  terminal,  word  w  where  w  is  in  R^,  and  where  b  and  c  do  not  occur  in  the 

initial  or  terminal  word  of  either.  If  f'(T^)  is  congruent  to  f'(T2),  then  T^ 
is  congruent  to  Tg.  If  T^  and  Tg  are  complete  H '-graphs,  then  f'Ct^)  and 
F'(Tg)  are  complete  G- graphs  and  hence  must  be  congruent.  Therefore  h'  Is 
unambiguous  modulo  R^. 

Now  let  us  show  that  L(h')  ■  M(L(H)  Hr').  Let  T  be  an  H- graph  with 
terminal  word  in  r'.  There  exist  k  disjoint  terminal  subwords  p^q^, 
with  ■  b,  tp(q^)  -  c  for  each  i,  and  such  that  any  node  r  with  cp(r )  -  b,c 

is  one  of  the  p^  or  q^.  T  is  an  H  '-graph,  and  if  we  adjoin  new  nodes 
r^,  ...,rk,  new  productions  (p^q^,r^),  such  that  cp(r^)  -  a,  the  resulting  graph 
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T/  is  an  H7-graph.  Define  T7  ■  F(T).  If  the  terminal  word  of  T  is  w,  then  the 
terminal  word  of  T7  is  M(w).  Thus  M(l(h)  OR7)  c  L(H7).  Conversely,  any 
H7-graph  without  b,c  in  the  terminal  word  contains  k  i  0  terminal  productions 

(plqLl,rl),'**,(pkqk,rk)  with  "  b>  “  c>  ■  a*  Deleting  these 

productions  gives  an  H-graph  with  terminal  word  w  in  R  7,  and  M(w)  is  the 
terminal  word  of  T7.  So  L(H7)  -  M(L(H)  Or'). 

Now  we  can  verify  (i).  L(G)  0  R^  ■  M(l(h)  0  R  7)  fl  R^.  If  a  word  w  is  in 
M(L(H)  0  R2),then  w  -  M(v)  for  seme  v  in  L(H)  PI  R2»  L(H)  0  R2  -  L(h)  P  M-1^)  Pi  r', 
so  M(v)  ■  w  is  in  R^.  v  is  al6o  in  I (H)  0  R7  so  M(v)  is  in  M(L(H)  Or').  Thus 
w  is  in  M(L(E)  Hr')  n  R^.  Conversely,  if  w  is  in  M(L(h)  0  r')  fl  then 
w  »  M(v)  for  some  v  in  L(h)  Or'.  Since  M(v)  is  in  R1,  v  is  also  in  M**^(R. ) . 

So  v  is  in  L(H)  0  R  '  0  R1  =  L(H)  0  R  .  So  M(v)  -  w  is  in  M(L(H)  PI  R  ).  Thus 
M(L(H)  Hr7)  n  R  »  M(l(h)  n  R2),  and  this  with  the  previous  relation  implies 

l(g)  n  rx  -  m(l(h)  n  r2). 

To  verify  (ii),  we  already  know  that  if  G  is  unambiguous  modulo  Rp  then 
so  is  H7.  We  want  to  conclude  that  H  is  unambiguous  modulo  M~^(R^)  PI  R*.  Let 
T^  and  T2  be  complete  H-graphs  1th  the  same  terminal  word  w  where  w  is  in  R  7 
and  M(w)  is  in  R^.  F(T^)  and  F(T2)  are  complete  H7- graphs,  each  with  terminal 

word  M(w).  Since  H7  is  unambiguous  modulo  R^,  F(T^)  is  congruent  to  F(T2). 

But  T^  and  T2  are  formed  from  F(T^)  and  F(Tg)  respectively  by  deleting 
productions  that  correspond  in  the  congruence.  Therefore  T^  is  congruent  to 
T2«  Therefore  H  is  unambiguous  modulo  Rg. 

To  verify  (iii),  suppose  G  is  unambiguous  modulo  R^,  and  let  and  w2 
be  two  words  in  L(H)  PI  Rg.  We  want  to  prove  that  M(v^)  ^  M(w2)  if  w^  ^  Wg. 
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Let  Tj.  and  be  complete  H-graphs  with  terminal  words  and  w^  respectively. 
Then  F(T^)  and  F(T2)  are  complete  H-graphs  with  terminal  words  M^)  and  M(w2) 
respectively.  Since  w^  and  w2  are  in  Rg  ■  M“i(R^)  fl  r',  M(w^)  and  M(w2)  are 
in  R, .  If  M^)  ■  M(w2),  then  by  the  unambiguity  modulo  R^  of  H*,  F^)  is 
congruent  to  F(Tg).  We  noted  before  that  this  entails  being  congruent  to  T,,. 
But  that  would  require  w^  ■  \r^.  Therefore  M  is  one-to-one  on  L(H)  0  Rg. 

This  finishes  the  proof. 

PROPOSITION  3.6.  Let  G  be  a  context  limited  grammar  and  R  a  regular  set.  Then 
L(G)  0  R  is  a  context  free  language.  Moreover,  if  G  is  unambiguous  modulo  R, 
then  L(G)  OR  is  unambiguous  aB  a  context  free  language. 

Proof.  Suppose  there  is  a  c ounter example .  Let  x  be  the  smallest  integer  for 
which  there  is  a  counterexample  G '  such  that  l^G*)  »  x.  Let  y  be  the  smallest 
integer  for  which  there  is  a  counterexample  G  with  1^(G)  ■  x  and  12(G)  ■  y. 

Suppose  y  >  0.  Let  R^  be  a  regular  set.  G  and  R^  satisfy  the  hypothesis 
of  3*5«  So  let  M,  H  and  R0  be  as  in  the  conclusion  of  3.5.  Then  12(H)  <  y 
and  L^H)  «=  x,  so  L(H)  0  R2  is  a  context  free  language.  Sequential  machine 
mappings  preserve  context  freedom  [5,  p.l84],  so  M(L(H)  0  r  )  is  context  free. 
Thus  L(G)  0  R1  is  a  context  free  language.  If  G  is  unambiguous  modulo  R  , 
then  H  is  unambiguous  modulo  R£.  Again,  because  H  satisfies  the  present 
proposition,  L(H)  0  Rg  is  an  unambiguous  context  free  language.  Moreover,  M 
is  one-to-one  on  L(H)  H  R^,  so  that  M  produces  an  unambiguous  image  of 
L(H)  H  [6,  Theorem  1],  So  L(G)  0  R^  is  unambiguous.  But  this  would 
that  G  is  not  a  counterexample.  Therefore  y  ■  0. 


mean 
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Suppose  x  >  0.  Let  R  be  a  regular  set.  G  satisfies  the  hypothesis  of 
3.4.  Let  H  be  as  in  the  conclusion  of  3.4.  Then  L^H)  <  x.  So  L(H)  fi  R  is  a 
context  free  language.  But  L(H)  ■  L(G),  so  L(g)  fi  R  is  a  context  free  language. 
Moreover,  if  G  is  unambiguous  modulo  R,  then  H  is  unambiguous  modulo  R,  and 
since  H  is  not  a  counterexample,  L(H)  H  R  must  be  unambiguous,  i.e.,  L(G)  0  R 
is  unambiguous,  and  we  have  contradicted  the  assumption  that  G  is  a  counterexample. 

Therefore  x  and  y  are  both  0.  But  this  means  that  G  is  already  a  context 
free  grammar,  hence  L(G)  0  R  is  a  context  free  language  for  every  regular  set  R, 

for  intersection  with  regular  sets  preserves  context  freedom  [l,  p.171].  By 
3.3,  if  G  is  unambiguous  modulo  R,  then  L(G)  fl  R  is  unambiguous.  Again 

we  have  to  deny  that  G  is  a  counterexample,  so  there  must  not  be  any 

cou nte rexemple  s . 

Letting  R  -  £*  where  E  is  the  external  alphabet  of  G,  ve  have  immediately 
from  3*6,  3«1>  and.  2.2.4: 

THEOREM  1.  The  context  limited  .aiguages  are  (disregarding  c)  Just  the  context 
free  languages. 

THEOREM  2.  The  unambiguous  context  3  united  languages  are  (di  sregarding  e) 

Just  the  unambiguous  context  free  lang^es. 
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1}  ABSTRACT 


A  grammar  G  is  context  limited  if  there  exists  a  partial  ordering  on  the  alphabet 
of  G  under  which,  for  every  product  ion  C^-f^  of  G,  every  letter  of  <\  is  smaller  than 
some  letter  of  $  .  It  is  proved  that  the  languages  generated  by  contect  limited 
grammars  are  just  the  context  free  languages.  Unambiguity  of  general  grammars  is 
defined  and  discussed  carefully,  preparatory  to  proving  that  the  languages 
generated  by  unambiguous  context  limited  grammars  are  just  the  unambiguous 
context  free  languages. 
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